KEP: Extending Apiserver Network Proxy to handle traffic originated from Node network #2025

irozzo-1A · 2020-09-28T18:02:39Z

Enhancement issue: #2347

Rendered version: https://github.com/kubernetes/enhancements/tree/master/keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions

k8s-ci-robot · 2020-09-28T18:02:47Z

Welcome @irozzo-1A!

It looks like this is your first PR to kubernetes/enhancements 🎉. Please refer to our pull request process documentation to help your PR have a smooth ride to approval.

You will be prompted by a bot to use commands during the review process. Do not be afraid to follow the prompts! It is okay to experiment. Here is the bot commands documentation.

You can also check if kubernetes/enhancements has its own contribution guidelines.

You may want to refer to our testing guide if you run into trouble with your tests not passing.

If you are having difficulty getting your pull request seen, please follow the recommended escalation practices. Also, for tips and tricks in the contribution process you may want to read the Kubernetes contributor cheat sheet. We want to make sure your contribution gets all the attention it needs!

Thank you, and welcome to Kubernetes. 😃

k8s-ci-robot · 2020-09-28T18:02:47Z

Hi @irozzo-1A. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

kikisdeliveryservice · 2020-09-29T23:55:18Z

1.20 Enhancements Lead here 👋 does this KEP have an issue open? If not please open an issue in the Issues Tab and also add a link to the issue here.

Also, this KEP is using the older format that is missing the Production Readiness Review Questionnaire, etc... so if you could please update that would be awesome (see for ref https://github.com/kubernetes/enhancements/tree/master/keps/NNNN-kep-template)

irozzo-1A · 2020-09-30T16:39:22Z

1.20 Enhancements Lead here 👋 does this KEP have an issue open? If not please open an issue in the Issues Tab and also add a link to the issue here.

I don't have any one issue yet. I'll take care.

Also, this KEP is using the older format that is missing the Production Readiness Review Questionnaire, etc... so if you could please update that would be awesome (see for ref https://github.com/kubernetes/enhancements/tree/master/keps/NNNN-kep-template)

I will also do that, thx for the pointer ;-)

bowei · 2020-09-30T17:20:42Z

/assign

cheftako · 2020-09-30T17:21:59Z

/cc @anfernee

cheftako · 2020-09-30T17:22:21Z

/ok-to-test

caesarxuchao

@irozzo-1A thanks for starting this KEP! I asked for some clarifications.

caesarxuchao · 2020-09-30T19:34:52Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+## Motivation
+
+API server network proxy has been originally introduced to allow running the cluster nodes on distinct isolated networks with respect to the one hosting the control plane components. This provides a way to handle traffic originating from the Kube API Server and going to the node networks. When using this setup, there are no other options than to directly expose the KAS to the Internet or setting up a VPN to handle traffic originated from the cluster nodes (i.e. Kubelet, pods). This could lead to security risks or complicated setups. 


This could lead to security risks or complicated setups.

Can you elaborate on why exposing the Konnnectivity Proxy Server to the cluster network solve the "security risks or complicated setups"?

Regarding the "security risks": I think that exposing Konnectivity Proxy Server would add an additional layer of security, given that the channels with proxy agents are secured with mTLS or Token authentication.
The KAS is not directly exposed to the internet but only accessible from "secured" networks. This would protect it, for instance, from KAS misconfigurations or vulnerabilities that could expose sensitive information
and/or access to unauthenticated users.

IMHO on a higher level, if we believe that exposing the Kubelet to the internet brings security risks, the same should hold for the KAS.

Regarding the "complicated setups": You are right we did not elaborate enough on this point, and "complicated" is not the appropriate term to be used here. We will amend the proposal.
The point we want to make is that right now there is no standard solution for this kind of setup. It is possible to rely on VPNs for example to achieve a similar goal, but this requires specific implementations. What we propose here is to build on top of what we have, and having a consistent approach for master to node and node to master communications.

caesarxuchao · 2020-09-30T19:42:59Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+* `--bind-address=ip`: Local IP address where the Konnectivity Agent will listen for incoming requests. It will be bound to a dummy IP interface with IP x.y.z.v defined by the user. Must be used with the previous one to enable incoming requests. If not, and for backward compatibility, only the traffic initiated from the Control Plane will be allowed.
+
+### Handling the Traffic from the Pods to the Agent


How does the agent authenticate the pods or the kubelet?

It doesn't, it acts as a TCP forwarder without terminating TLS.

caesarxuchao · 2020-09-30T19:45:05Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+```
+
+The agent listens for TCP connections at a specific port for each configured destination. When a connection request is received by the Konnectivity Agent the following happens:
+1. A GRPC DIAL_REQ message is sent to the Konnectivity server containing the destination address associated with the current port.


Has the proxy agent finished the TCP handshake with the client before step 1?

Yes, this is what we had in mind. I can make this explicit if you think it's necessary.

I'm not sure it would be easy to send the DIAL_REQ at SYN or SYN/ACK phase. How could we do it?
As far as I know TCPListener.Accept method returns the connection once the TCP handshake is over.

BTW do you foresee any advantage in sending the DIAL_REQ before the TCP handshake is over?

caesarxuchao · 2020-09-30T19:50:12Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+### Agent additional flags
+
+* `--target=local_port:dst_host_ip:dst_port`: We can have multiple of those in order to support multiple destinations on the Master Network.
+Dst_host_ip: end target IP (apiserver or something else). In case of IPv6


I'm curious what the "something else" is. Can you give some examples?

(If we want to support proxying to things other than the apiserver, we should remove the "apiserver" from the repository name :)

Also how does the proxy server authenticate with the "something else"?

We don't have any use-case in mind at the moment. We could limit the scope to the KAS only.

caesarxuchao · 2020-09-30T20:01:18Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+### Deployment Model
+
+The agent can be run as static pod or systemd units. In any case the agent should be started to give access to the KAS to the kubelet first and to the hosted pods later. This means that using DaemonSets or Deployments is not an option in this setup, because the kubelet would not be able to get the pod manifests from the KAS.


I'm not sure if running as a static pod will solve the kubelet bootstrapping issue. Kubelet needs to access to KAS to watch for secrets/configmaps, and to register the node. I don't know if kubelet can bootstrap in this order:

run the proxy agent as a static pod, waiting for it to establish tunnels to the KAS,

start watching for secrets/configmaps/pods and register the node.

Also I'm not sure if other node components like node-problem-detector are ok with this bootstrap order.

I think @cheftako knows this better.

Definitely the proxy agent is required to be up and running in order for the kubelet to perform its bootstrap sequence. I was expecting the static pods to be running without the kubelet being registered, but I did not test it and I agree we should double check if this is feasible.

caesarxuchao · 2020-09-30T20:05:24Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+### Authentication
+
+Konnectivity agent currently support mTLS or Token based authentication. Note that API objects such as Secrets cannot be accessed either when a StaticPod or Systemd service deployment strategy is used. The authentication secret should be made available to the agent through a different channel (e.g. provisioned in the worker node file-system).


This is talking about the authentication between the proxy agent and the proxy server, right? Can you point this out in the KEP?

Yes you are right, we'll make it clear.

andrewsykim · 2020-09-30T21:11:36Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

@@ -0,0 +1,194 @@
+---
+title: Out-of-Tree Credential Providers


Copy paste error I'm guessing? :P

Yep, thx for pointing out.

andrewsykim · 2020-09-30T21:12:05Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+## Summary
+
+The goal of this proposal is to allow traffic to flow from the Node Network to the Master Network.


"Cluster network to the Control Plane network"

We initially used that terminology, but we changed it later to be consistent with the definitions below. Of course we are open to change if you think it makes it more clear.

andrewsykim · 2020-09-30T21:16:50Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+As mentioned above, pods make use of  the Kubernetes default service to reach the KAS. To keep things transparent from a Pod perspective, they will hit the Konnectivity Agent using the Kubernetes default service. The endpoint will be the Konnectivity Agent instead of the KAS. 
+The configuration part of the Kubernetes default service will be done using the Apiserver flag `--advertise-address ip` on the Control Plane side.
+`--advertise-address ip` should match the `--bind-address ip` of the Konnectivity Agent described above. 


Would the serving port used in updating the default kubernetes Service be updated as well? And if so that would imply kube-apiserver and Konnectivity Agent listen on the same ports?

Yes, the Agent should listen on the secure port used by the KAS. I think that we should make this more clear. Thx for the hint.

anfernee · 2020-10-03T00:26:23Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+### Handling the Traffic from the Pods to the Agent
+
+As mentioned above, pods make use of  the Kubernetes default service to reach the KAS. To keep things transparent from a Pod perspective, they will hit the Konnectivity Agent using the Kubernetes default service. The endpoint will be the Konnectivity Agent instead of the KAS. 


Do you mean the cluster-ip service (something like 10.96.0.1)? does that mean kube-proxy has dependency on the konnectivity? It looks like it's not, but want to make sure it's accurate.

Yes, that is correct.
Kube-proxy has no dependency on the Konnectivity agent. As we mentioned below, when configuring the KAS it is necessary to make sure that the IP used by the Agent is the same as the one advertised by the KAS.

anfernee · 2020-10-03T00:37:44Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+
+Currently the Konnectivity Server is accepting requests from the KAS either with the gRPC or the HTTP Connect interfaces and is taking care of forwarding the traffic to the Konnectivity Agent using the previously established connections (initiated by the Agents). 
+
+In order to enable traffic from Kubelets and Pods running on Master Network, the Konnectivity Agents have to expose an endpoint that will be listening on a specific port for each of the destinations on the Master Network. As opposed to the traffic flowing from the Master Network to the Node Network, the Konnectivity Agent should act transparently: From a Kubelets or Pods standpoint, the Konnectivity Agent should be the final destination instead of acting as a proxy. 


It's a little bit confusing. It's still a proxy that forwards the traffic to KAS, right?

Yes, you can see this as an equivalent to ssh remote port forwarding. The client is not aware of interacting with a proxy but from his standpoint, he is sending the request to his final destination.

anfernee · 2020-10-03T00:44:27Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+The agent listens for TCP connections at a specific port for each configured destination. When a connection request is received by the Konnectivity Agent the following happens:
+1. A GRPC DIAL_REQ message is sent to the Konnectivity server containing the destination address associated with the current port.
+2. Upon reception of the DIAL_REQ the Konnectivity Server opens a TCP connection with the destination host/port and replies to the Konnectivity Agent with a GRPC DIAL_RES message.
+3. At this point the tunnel is established and data is piped through it, carried over GRPC DATA packets.


I guess this has big performance impact. Because the other end is always KAS:6443, It seems a reverse proxy on konnectivity server side could do the same job without agent.

Indeed, we have the same performance penalty that we are paying with traffic going from the Konnectivity server to the nodes.
When using the reverse proxy, we don't have the additional layer of security provided by the authentication between the Konnectivity Agent and the Konnectivity Server but also we lose the ability to use SNI for load balancing.

I guess a whitelist is necessary this sort of proxy in reserve order. master network has access everything in node network, but I am not sure the reverse is true.

+1 to having an explicit allow list on the Konnectivity Server which controls where it will allow traffic to be sent on the control plane.

@anfernee @cheftako
Indeed, that is correct and we have taken this into account in the Risks and Mitigations / Allow list section.

Jefftree · 2020-10-05T18:00:19Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+### Traffic Flow
+
+```
+client =TCP=> (:6443) agent GRPC=> server =TCP=> KAS(:6443)


Since the control plane and cluster networks are disjoint, can you elaborate on how to agent -> server tunnel is established? Since the agent shares the same network as other pods on the cluster network, can other pods (eg: kubelet) not directly tunnel to the konnectivity server as well?

It is established by exposing the Konnectivity server. The only requirement is that the Agent must be able to route traffic to the Konnectivity Server (equivalent to what is required today).

Regarding the second question, as we don't have the hands-on on the clients reaching the KAS (apart from the kubelet), we cannot force them to establish tunnels with the Konnectivity Server.

Jefftree · 2020-10-05T18:03:54Z

keps/sig-cloud-provider/20200928-extend-konnectivity-for-both-directions.md

+### Handling the Traffic from the Kubelet to the Agent
+
+Kubelet does not use the Kubernetes default service to reach the KAS. Instead it relies on a bootstrap kubeconfig file that is used to connect to the KAS. It then generates a proper kubeconfig file that will be using the same URL.
+Instead of specifying the KAS FQDN/address in the bootstrap kubeconfig file, we will be using the local IP address of the Konnectivity agent (`--bind-address ip`).


Are pods only able to communicate to the agent on the same machine?

Yes, because it will bind to a local scoped interface.

youssefazrak · 2020-10-20T13:47:19Z

@cheftako we have addressed the comments. Would be great to get another review :)

huxiaoliang · 2021-01-12T10:00:19Z

@cheftako what is the status update on this thread? do you plan to put it in scope to a release? thanks in advance.

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

irozzo-1A · 2021-01-25T12:15:35Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+As mentioned before, we will be using the Kubernetes default service to route traffic to the agent. The service in itself has a couple of limitations: it can’t be used as a type externalName, thus preventing usage of DNS names. But also, some general services limitations apply: endpoints can't use the link-local range and the localhost range. This means that we are left with the 3 private IPs ranges (10.0.0.0/8, 172.16.0.0/12 and 192.168.0.0/16).
+
+The agent will create a dummy interface, assign it the ip provided with the `bind-address` flag, using `host` scope, and will start listening on this IP:local_port (local_port is defined with the `target` flag). This will allow all agents to bind to the ip address advertised by the KAS, that will be valid only inside the node.


We thought about a possible shortcoming with this approach. The idea is to redirect the KAS traffic generated from PODs on the node network to Konnectivity agents, by setting the IP used by the agents with advertise-address flag of the KAS. This IP will be set in the Endpoints of the kubernetes service in the default namespace, which is used by k8s clients.

On the other hand, this could be limiting if PODs deployed on the master network rely on the kubernetes default service as well (e.g. CNI pods). As we won't have agents deployed on master nodes, this IP will be unreachable. To circumvent this, we could use the IP address of a load-balancer targeting the KAS instances in an HA setup.

Alternatively, we could consider another approach. The agent could use iptables and "dnat" the traffic going to the Kubernetes default service to the local agent. In this case, the configuration will be probably easier, but the reason we initially discarded this option is that the implementation is more complex as we would need to take care of possible conflicts with other rules (e.g. kube-proxy in iptables mode).

@cheftako @anfernee @timoreimann any thoughts about this?

Not sure if I understand the load-balancer scenario correctly.
If you set the KAS LB IP via advertise-address to set it as endpoint for the kubernetes svc, you wouldn't need the agents in the first place, would you?

On the LB side there's another potential downside for providers that try to host multiple KAS instances from different clusters behind one LB. There's no way to differentiate to which cluster the traffic belongs. SNI information will yield kubernetes for all. The SNI info from the agent connections makes that possible.

Just using the load-balancer approach on the master could also lead to a problem. IIRC on some cloud providers services behind the LB cannot directly talk to the external LB IP. You'd need to "dnat" this again on the master node, which we could do anyway (also in other scenarios) on the master node.

Personally I'd be fine to not be able to use the kubernetes svc on the master and be forced to configure the services with the k8s endpoint.

With regards to network ranges that we could use: Shouldn't we also be able to make use of the "carrier grade NAT" range (100.64.0.0 - 100.127.255.255). I'm assuming this has less chance of conflicting with any existing cluster ranges.

Hi @gottwald, thanks for reacting!

If you set the KAS LB IP via advertise-address to set it as endpoint for the kubernetes svc, you wouldn't need the agents in the first place, would you?

I think I did not express clearly enough my idea. What I meant is that we could use a private LB with an IP that is not routable from the node network, and will be used from within the master network only, to allow the PODs to reach the KAS via the default kubernetes service. On the node network, things would be unchanged.

Personally I'd be fine to not be able to use the kubernetes svc on the master and be forced to configure the services with the k8s endpoint.

If no one has any objection, I would be fine to start with this limitation too.

With regards to network ranges that we could use: Shouldn't we also be able to make use of the "carrier grade NAT" range (100.64.0.0 - 100.127.255.255). I'm assuming this has less chance of conflicting with any existing cluster ranges.

Thx for the hint. I did not know about this range. Using link-local range would be probably the safest/cleanest solution, but as we cannot because of endpoints limitations I have nothing against mentioning this range too, @youssefazrak Any thoughts about this?

Nothing against it. And actually, that's a good idea.
The range is private to the cluster and supposedly a host reaches from a CGN network, there will be no routing conflict as it is NATed to another range.

cheftako · 2021-02-08T21:39:44Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

@@ -0,0 +1,694 @@
+<!--
+**Note:** When your KEP is complete, all of these comment blocks should be removed.


Seems like the metadata has been moved to the kep.yaml, would be nice to get rid of these big comment blocks.

cheftako · 2021-02-08T21:40:15Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+-->
+# KEP-2025: Extending Apiserver Network Proxy to handle traffic originated from Node network
+
+<!--


Please 😸

cheftako · 2021-02-08T21:41:58Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+[documentation style guide]: https://github.com/kubernetes/community/blob/master/contributors/guide/style-guide.md
+-->
+
+The goal of this proposal is to allow traffic to flow from the Node Network to the Master Network.


Can we elaborate here? In many environments this already works, so can we attempt a description of when you would need this solution?

"The goal of this proposal is to provide a mechanism which allows traffic to flow from the Node Network to the Master Network, when those networks are otherwise isolated and there is a desire not to expose the Kubernetes API Server publicly"?

yes indeed, that's more explicit

cheftako · 2021-02-08T21:45:55Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+List the specific goals of the KEP. What is it trying to achieve? How will we
+know that this has succeeded?
+-->
+* Handle requests from the nodes to the control plane. Enable communication from the Node Network to the Master Network without having to expose the KAS to the Node Network.


Is it just Nodes or is it any KAS client running in the Node Network? (So operators and the like)

it's any KAS client running on node network

cheftako · 2021-02-08T21:47:12Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+-->
+* Define a mechanism for exchanging authentication information used for establishing the secure channels between agents and server (e.g. certificates, tokens).
+* Define a solution involving less than one agent per node.
+* Being able to reach arbitrary destinations on the master network, this could be considered in the future if some use-cases arise.


For now can we restrict it to just the KAS? (i.e. non-goal to talk to anything other than the KAS on the master network)

cheftako · 2021-02-08T21:49:19Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+-->
+Currently the Konnectivity Server is accepting requests from the KAS either with the gRPC or the HTTP Connect interfaces and is taking care of forwarding the traffic to the Konnectivity Agent using the previously established connections (initiated by the agents).
+
+In order to enable traffic from Kubelets and pods running on Node Network, the Konnectivity Agents have to expose an endpoint that will be listening on a specific port and forward the traffic to the KAS on the Master Network. As opposed to the traffic flowing from the Master Network to the Node Network, the Konnectivity Agent should act transparently: From a Kubelets or pods standpoint, the Konnectivity Agent should be the final destination instead of acting as a proxy.


Anything capable of sending to that port will be able to send traffic to the KAS. We may want to think about options like listening on localhost or firewalling of that port for non node network traffic.

Yeah, we plan to use a host scope address, so that it will possible to use it from within the host itself only.

cheftako · 2021-02-08T21:53:24Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+### Traffic Flow
+
+```
+client =TCP=> (:6443) agent GRPC=> server =TCP=> KAS(:6443)


I'm fine with port 6443 being the default but I would suggest that be configurable.

In a HA setup there could be multiple servers that the agent is connected to. Do we have a reason to care where the traffic goes? (Eg. matching failure zone?) Or do we just pick a random server?

it is actually, I put 6443 just as an example here as it is the default used by KAS (if no secure_port flag is specified)

In a HA setup there could be multiple servers that the agent is connected to. Do we have a reason to care where the traffic goes? (Eg. matching failure zone?) Or do we just pick a random server?

@cheftako I would say that we pick a random server, but if we can think about evolving this later in case of need.

cheftako · 2021-02-08T21:57:16Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+
+* `--allowed-destination=dst_host:dst_port`: The address and port of the KAS.
+
+Note: if this feature will be extended to allow reaching arbitrary destinations in the master network, this can be easily generalized by allowing multiple occurrences of this flag and maintaining a list of allowed destinations.


Can we explicitly specify that if this flag is absent the server will no allow any node network initiated requests (traffic) to be placed on the master network.

annajung · 2021-02-09T18:22:00Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/kep.yaml

+owning-sig: sig-cloud-provider
+participating-sigs:
+  - sig-network
+status: provisional


Hi there, 1.21 Enhancements Lead here.

Please make sure to change the status to implementable to meet one of the requirements for all KEP tracking for the release.

Hi @annajung, it's done.

dims · 2021-02-09T18:41:01Z

/assign @johnbelamaric

deads2k · 2021-02-09T18:56:11Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+
+* **Can the feature be disabled once it has been enabled (i.e. can we roll back
+  the enablement)?**
+  Yes, it can be disabled by simply changing the KAS `--advertise-address` and


Seems like you'd need to update the workers to remove the konnectivity agent that's configured locally.

indeed @deads2k the agents should be removed. I'll add this step

deads2k · 2021-02-09T18:56:53Z

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md

+* **What are the SLIs (Service Level Indicators) an operator can use to determine 
+the health of the service?**
+  - [ ] Metrics
+    - Metric name:


This isn't required for alpha, but if/when you move to beta, I'd like to see metrics from the konnectivity agent.

deads2k · 2021-02-09T19:23:21Z

the PRR looks good for alpha. Please keep the comment about metrics in mind for beta.

/approve

…ode network

cheftako · 2021-02-09T23:14:47Z

/lgtm
/approve

k8s-ci-robot · 2021-02-09T23:14:59Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cheftako, deads2k, irozzo-1A

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [deads2k]
~~keps/sig-cloud-provider/OWNERS~~ [cheftako]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Sep 28, 2020

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/cloud-provider Categorizes an issue or PR as relevant to SIG Cloud Provider. labels Sep 28, 2020

k8s-ci-robot requested review from andrewsykim and cheftako September 28, 2020 18:02

k8s-ci-robot assigned bowei Sep 30, 2020

k8s-ci-robot requested a review from anfernee September 30, 2020 17:22

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Sep 30, 2020

caesarxuchao reviewed Sep 30, 2020

View reviewed changes

andrewsykim reviewed Sep 30, 2020

View reviewed changes

anfernee reviewed Oct 3, 2020

View reviewed changes

Jefftree reviewed Oct 5, 2020

View reviewed changes

cheftako mentioned this pull request Oct 15, 2020

Agent to server communication through an egress proxy kubernetes-sigs/apiserver-network-proxy#127

Open

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Oct 16, 2020

charleszheng44 mentioned this pull request Nov 9, 2020

The network connection between Cloud and Edge openyurtio/openyurt#138

Closed

andrewsykim added this to the v1.21 milestone Jan 20, 2021

timoreimann reviewed Jan 21, 2021

View reviewed changes

keps/sig-cloud-provider/2025-extend-konnectivity-for-both-directions/README.md Outdated Show resolved Hide resolved

irozzo-1A commented Jan 25, 2021

View reviewed changes

irozzo-1A mentioned this pull request Jan 27, 2021

Extending Apiserver Network Proxy to handle traffic originated from Node network #2347

Closed

4 tasks

cheftako reviewed Feb 8, 2021

View reviewed changes

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Feb 8, 2021

irozzo-1A force-pushed the extend-konnectivity branch from db194f0 to 560844c Compare February 9, 2021 18:07

annajung reviewed Feb 9, 2021

View reviewed changes

k8s-ci-robot assigned johnbelamaric Feb 9, 2021

deads2k reviewed Feb 9, 2021

View reviewed changes

Extending Apiserver Network Proxy to handle traffic originated from N…

be94d62

…ode network

irozzo-1A force-pushed the extend-konnectivity branch from 591c44a to be94d62 Compare February 9, 2021 21:46

k8s-ci-robot assigned cheftako Feb 9, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 9, 2021

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 9, 2021

k8s-ci-robot merged commit 6576376 into kubernetes:master Feb 9, 2021

irozzo-1A deleted the extend-konnectivity branch February 9, 2021 23:21

timoreimann mentioned this pull request Feb 23, 2021

Implement support for node to master network traffic (KEP-2025) kubernetes-sigs/apiserver-network-proxy#171

Open

5 tasks


		## Motivation

		API server network proxy has been originally introduced to allow running the cluster nodes on distinct isolated networks with respect to the one hosting the control plane components. This provides a way to handle traffic originating from the Kube API Server and going to the node networks. When using this setup, there are no other options than to directly expose the KAS to the Internet or setting up a VPN to handle traffic originated from the cluster nodes (i.e. Kubelet, pods). This could lead to security risks or complicated setups.


		* `--bind-address=ip`: Local IP address where the Konnectivity Agent will listen for incoming requests. It will be bound to a dummy IP interface with IP x.y.z.v defined by the user. Must be used with the previous one to enable incoming requests. If not, and for backward compatibility, only the traffic initiated from the Control Plane will be allowed.

		### Handling the Traffic from the Pods to the Agent


		### Deployment Model

		The agent can be run as static pod or systemd units. In any case the agent should be started to give access to the KAS to the kubelet first and to the hosted pods later. This means that using DaemonSets or Deployments is not an option in this setup, because the kubelet would not be able to get the pod manifests from the KAS.


		### Authentication

		Konnectivity agent currently support mTLS or Token based authentication. Note that API objects such as Secrets cannot be accessed either when a StaticPod or Systemd service deployment strategy is used. The authentication secret should be made available to the agent through a different channel (e.g. provisioned in the worker node file-system).


		## Summary

		The goal of this proposal is to allow traffic to flow from the Node Network to the Master Network.


		### Handling the Traffic from the Pods to the Agent

		As mentioned above, pods make use of the Kubernetes default service to reach the KAS. To keep things transparent from a Pod perspective, they will hit the Konnectivity Agent using the Kubernetes default service. The endpoint will be the Konnectivity Agent instead of the KAS.


		Currently the Konnectivity Server is accepting requests from the KAS either with the gRPC or the HTTP Connect interfaces and is taking care of forwarding the traffic to the Konnectivity Agent using the previously established connections (initiated by the Agents).

		In order to enable traffic from Kubelets and Pods running on Master Network, the Konnectivity Agents have to expose an endpoint that will be listening on a specific port for each of the destinations on the Master Network. As opposed to the traffic flowing from the Master Network to the Node Network, the Konnectivity Agent should act transparently: From a Kubelets or Pods standpoint, the Konnectivity Agent should be the final destination instead of acting as a proxy.

		As mentioned before, we will be using the Kubernetes default service to route traffic to the agent. The service in itself has a couple of limitations: it can’t be used as a type externalName, thus preventing usage of DNS names. But also, some general services limitations apply: endpoints can't use the link-local range and the localhost range. This means that we are left with the 3 private IPs ranges (10.0.0.0/8, 172.16.0.0/12 and 192.168.0.0/16).

		The agent will create a dummy interface, assign it the ip provided with the `bind-address` flag, using `host` scope, and will start listening on this IP:local_port (local_port is defined with the `target` flag). This will allow all agents to bind to the ip address advertised by the KAS, that will be valid only inside the node.

		@@ -0,0 +1,694 @@
		<!--
		Note: When your KEP is complete, all of these comment blocks should be removed.


		* `--allowed-destination=dst_host:dst_port`: The address and port of the KAS.

		Note: if this feature will be extended to allow reaching arbitrary destinations in the master network, this can be easily generalized by allowing multiple occurrences of this flag and maintaining a list of allowed destinations.

KEP: Extending Apiserver Network Proxy to handle traffic originated from Node network #2025

KEP: Extending Apiserver Network Proxy to handle traffic originated from Node network #2025

Conversation

irozzo-1A commented Sep 28, 2020 • edited

k8s-ci-robot commented Sep 28, 2020

k8s-ci-robot commented Sep 28, 2020

kikisdeliveryservice commented Sep 29, 2020

irozzo-1A commented Sep 30, 2020

bowei commented Sep 30, 2020

cheftako commented Sep 30, 2020

cheftako commented Sep 30, 2020

caesarxuchao left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

irozzo-1A Oct 1, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

irozzo-1A Oct 1, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

youssefazrak Oct 16, 2020 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

youssefazrak commented Oct 20, 2020

huxiaoliang commented Jan 12, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

irozzo-1A Jan 27, 2021 • edited

Choose a reason for hiding this comment

youssefazrak Jan 27, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dims commented Feb 9, 2021

deads2k Feb 9, 2021 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

deads2k commented Feb 9, 2021

irozzo-1A commented Sep 28, 2020 •

edited

irozzo-1A Oct 1, 2020 •

edited

irozzo-1A Oct 1, 2020 •

edited

youssefazrak Oct 16, 2020 •

edited

irozzo-1A Jan 27, 2021 •

edited

youssefazrak Jan 27, 2021 •

edited

deads2k Feb 9, 2021 •

edited