Endpoints api object needs change to allow useful merge patching #47787

shyamjvs · 2017-06-20T13:56:03Z

Problem: Endpoints object currently is unsuitable for doing any reasonable PATCH operation (i.e. at the level of individual list entries, because a PATCH on the entire list is as good as a PUT of the endpoints object and hence useless).

Reason: There is no way of key'ing individual entries in the endpoints list currently because we do not have one entry for each <IP, Port> pair but rather have one entry for all the IPs having the same set of ports. This is represented as a cartesian product:

#### trimmed down for brevity ####
type Endpoints struct {
	Subsets []EndpointSubset `json:"subsets" protobuf:"bytes,2,rep,name=subsets"`
}

// For eg.
//   {
//     Addresses: [{"ip": "10.10.1.1"}, {"ip": "10.10.2.2"}],
//     Ports:     [{"name": "a", "port": 8675}, {"name": "b", "port": 309}]
//   }
type EndpointSubset struct {
	Addresses []EndpointAddress `json:"addresses,omitempty" protobuf:"bytes,1,rep,name=addresses"`
	NotReadyAddresses []EndpointAddress `json:"notReadyAddresses,omitempty" protobuf:"bytes,2,rep,name=notReadyAddresses"`
	Ports []EndpointPort `json:"ports,omitempty" protobuf:"bytes,3,rep,name=ports"`
}

We need to key the EndpointSubset struct somehow.
Also, the compaction of size => some extra computation for the patch object (but I guess we are fine with it?)

cc @kubernetes/sig-api-machinery-misc @kubernetes/sig-scalability-misc @smarterclayton @gmarek

The text was updated successfully, but these errors were encountered:

shyamjvs · 2017-06-20T14:05:19Z

This can help a lot in the performance of the endpoint controller which is currently using PUTs instead of PATCHs. Related issue: #47597

shyamjvs · 2017-06-20T14:26:26Z

Maybe we can use the 'Addresses' list itself as the key but I'm not sure if that's going to be unique (though I'd hope so) and if it's fine to have a list as a key.
Moreover, seems like some previous discussions favoured using maps with keys for objects than having lists (ref: #4889) in such cases.
@smarterclayton WDYT?

liggitt · 2017-06-20T18:28:03Z

without contention, patch is actually more expensive both client-side (because patch computation is required) and server-side (because patch application and conflict-detection is required).

do we have contention on writing endpoints? I expected the endpoints controller to largely be the only writer

liggitt · 2017-06-20T18:28:51Z

I thought the main issue was the extremely short resync interval, not write conflicts

smarterclayton · 2017-06-21T00:02:42Z

I suggested PATCH to do blind overwrite, which avoids having to round trip between client and server, which reduces latency, which means the endpoints controller clears work faster, and moves more CPU to the master, where there is less latency to etcd.

gmarek · 2017-06-21T10:12:08Z

I agree with @smarterclayton - we want to avoid unnecessary GETs. The issue here is that we'd need to do some API changes to accomplish that. The question here is - are we doing things like this?

shyamjvs · 2017-06-21T12:59:58Z

cc @lavalamp @wojtek-t

liggitt · 2017-06-21T13:02:26Z

The issue here is that we'd need to do some API changes to accomplish that.

does a json merge patch that overwrites the entire subsets list not work as expected?

shyamjvs · 2017-06-21T13:13:28Z

@liggitt We are currently PUT'ing the endpoints object which IIUC is as good as merging the entire subsets list (as that's pretty much the whole endpoints object). This is needing us to send huge endpoints objects over the wire each time. Also (as @smarterclayton pointed out), this could be bad if just a single endpoint is flapping states.

shyamjvs · 2017-06-21T17:47:33Z

Seems like it's not possible to merge patch individual elements of arrays as per RFC 7386 (the one that our jsonpatch vendored package follows). Quoting from the document:
Also, it is not possible to patch part of a target that is not an object, such as to replace just some if the values in an array.

However IIUC we've solved this problem in k8s by using some field of the array elements' struct as the patchMergeKey, so individual array elements can be keyed on that field.
This means adding a 'key' field to the EndpointSubset struct should work. Does this make sense @smarterclayton / @liggitt ?

liggitt · 2017-06-21T17:50:35Z

This means adding a 'key' field to the EndpointSubset struct should work.

I think that would be a breaking API change

shyamjvs · 2017-06-21T18:27:04Z

That's true. However if we want to be able to merge patch, we need the elements of the array to be somehow keyed.

Another option I could think of was to use the index of the array element as it's key. This won't need any API change, but it's not a very clean way. Because a patch operation like "update element at index 5" might end up reaching after some other element got inserted there. What's worse is it won't be detected as a conflict and you'll end up wrongly updating unless there's some other safety mechanism.
Could there be some better option or we just live without patches? :)

shyamjvs · 2017-06-28T22:55:11Z

Closing this issue as it is infeasible without breaking API change. Let's reopen it in future if need be.

smarterclayton · 2017-06-28T23:53:32Z

Why bother with "patch individual address"? I was thinking more of "send all of subsets as one patch".

…

On Wed, Jun 28, 2017 at 6:55 PM, Shyam JVS ***@***.***> wrote: Closed #47787 <#47787>. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#47787 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABG_p88bfmhfdw3dGO2W6h3X6Iljtcpuks5sItnZgaJpZM4N_nw7> .

gmarek · 2017-06-29T06:49:55Z

Isn't that pretty much equivalent to normal update? There's not much data except those addresses.

shyamjvs · 2017-06-29T09:59:37Z

@smarterclayton Yes, the reason for starting the issue was to do smarter patching than patching the whole list (which is basically the entire ep object). Unless there are some other benefits of patching I might be missing?

smarterclayton · 2017-06-29T14:39:12Z

Conflict detection is done on the server, so round trip latency is lower and CPU used by deserialization is lower (mostly). Every round trip to the API server is expensive because it involves 4 more serialize/deserialize operations

…

On Thu, Jun 29, 2017 at 5:59 AM, Shyam JVS ***@***.***> wrote: @smarterclayton <https://github.com/smarterclayton> Yes, the reason for starting the issue was to do smarter patching than patching the whole list (which is basically the entire ep object). Unless there are some other benefits of patching I might be missing? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#47787 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABG_p5vHRR3crIvkw878zUCr6ctSv_Tvks5sI3WcgaJpZM4N_nw7> .

gmarek · 2017-06-29T15:35:26Z

OK, I clearly don't understand what you mean by "send all of subsets as one patch". Isn't it pretty much equivalent to plain update (assuming that metadata didn't change), except that instead of using protobuf we'd use JSON?

shyamjvs added area/api Indicates an issue on api area. area/controller-manager kind/api-change Categorizes issue or PR as related to adding, removing, or otherwise changing an API labels Jun 20, 2017

k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. sig/scalability Categorizes an issue or PR as relevant to SIG Scalability. labels Jun 20, 2017

shyamjvs mentioned this issue Jun 28, 2017

Endpoint removal delayed when pod is deleted (1.6) #47597

Closed

shyamjvs closed this as completed Jun 28, 2017

thockin mentioned this issue Jul 6, 2018

Catalog of known scalability issues with kubernetes services kubernetes/community#1984

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Endpoints api object needs change to allow useful merge patching #47787

Endpoints api object needs change to allow useful merge patching #47787

shyamjvs commented Jun 20, 2017

shyamjvs commented Jun 20, 2017

shyamjvs commented Jun 20, 2017

liggitt commented Jun 20, 2017

liggitt commented Jun 20, 2017

smarterclayton commented Jun 21, 2017

gmarek commented Jun 21, 2017

shyamjvs commented Jun 21, 2017

liggitt commented Jun 21, 2017

shyamjvs commented Jun 21, 2017

shyamjvs commented Jun 21, 2017 •

edited

Loading

liggitt commented Jun 21, 2017

shyamjvs commented Jun 21, 2017 •

edited

Loading

shyamjvs commented Jun 28, 2017

smarterclayton commented Jun 28, 2017 via email

gmarek commented Jun 29, 2017

shyamjvs commented Jun 29, 2017

smarterclayton commented Jun 29, 2017 via email

gmarek commented Jun 29, 2017

Endpoints api object needs change to allow useful merge patching #47787

Endpoints api object needs change to allow useful merge patching #47787

Comments

shyamjvs commented Jun 20, 2017

shyamjvs commented Jun 20, 2017

shyamjvs commented Jun 20, 2017

liggitt commented Jun 20, 2017

liggitt commented Jun 20, 2017

smarterclayton commented Jun 21, 2017

gmarek commented Jun 21, 2017

shyamjvs commented Jun 21, 2017

liggitt commented Jun 21, 2017

shyamjvs commented Jun 21, 2017

shyamjvs commented Jun 21, 2017 • edited Loading

liggitt commented Jun 21, 2017

shyamjvs commented Jun 21, 2017 • edited Loading

shyamjvs commented Jun 28, 2017

smarterclayton commented Jun 28, 2017 via email

gmarek commented Jun 29, 2017

shyamjvs commented Jun 29, 2017

smarterclayton commented Jun 29, 2017 via email

gmarek commented Jun 29, 2017

shyamjvs commented Jun 21, 2017 •

edited

Loading

shyamjvs commented Jun 21, 2017 •

edited

Loading