Reference implementation for sharded system #1064

erictune · 2014-08-27T22:47:36Z

A system where data is sharded among several pods will be a common pattern (that is, each pod handles a different chunk or chunks of some data that is too large for any one pod). As mentioned in #1007, the preferred way to tell each pod to do this is to have a work queue or work lease system. The not-preferred way to do this is to generate a different config for each pod which tells it which shard to handle.

It would be good to have a example application that follows the preferred pattern, to show users how we think k8s should be used, and to confirm that it is as easy to do as we think it will be.

A good example could be a search application. Maybe someone could get Apache Solr running on K8s.

lavalamp · 2014-08-27T22:51:52Z

@brendandburns maybe can we turn the flake finder into this? :)

smarterclayton · 2014-08-27T23:36:34Z

This is probably a dupe (or variation) of #260

bgrant0607 · 2014-10-02T20:32:51Z

Adding more details:

Sharding requires assignment of data to servers that are typically of finer granularity than pods, and some means for the client to find the server that has the desired data item.

One could also replicate shards for availability or hot-spot migitation, and/or cache them in multiple pods. Target affinity would likely be useful in the scenario that shard discovery were handled by the load balancer.

Sharding isn't necessarily incompatible with auto-scaling, though shard reassignment would be necessary.

Consistent hashing could potentially be used to assign shards, as could fine-grain locks or a work queue, as proposed above.

bgrant0607 · 2014-10-15T20:41:16Z

While I generally advocate dynamic work allocation, for simple use cases, a static shard assigner could statically map shards to the predictable hostnames generated for the cardinal service (#260).

bgrant0607 · 2017-02-10T22:41:44Z

Slicer:
https://www.usenix.org/conference/osdi16/technical-sessions/presentation/adya
https://www.usenix.org/system/files/conference/osdi16/osdi16-adya.pdf

fejta-bot · 2017-12-21T21:24:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

fejta-bot · 2018-01-20T22:12:24Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle rotten
/remove-lifecycle stale

Fix subnet.env saved path

erictune added the documentation label Aug 27, 2014

erictune mentioned this issue Aug 27, 2014

Proposal: Configuration #1007

Closed

bgrant0607 added sig/network Categorizes an issue or PR as relevant to SIG Network. area/downward-api area/api Indicates an issue on api area. kind/design Categorizes issue or PR as related to design. labels Oct 2, 2014

bgrant0607 mentioned this issue Oct 2, 2014

PetSet (was nominal services) #260

Closed

bgrant0607 added this to the v0.9 milestone Oct 4, 2014

bgrant0607 mentioned this issue Oct 7, 2014

Proposal: Headless services #1607

Closed

bgrant0607 added the workload/sharded label Oct 15, 2014

smarterclayton mentioned this issue Nov 14, 2014

Add Git PEP openshift/openshift-pep#44

Closed

bgrant0607 added status/closed/invalid priority/awaiting-more-evidence Lowest priority. Possibly useful, but not yet enough support to actually get it done. and removed status/closed/invalid labels Dec 3, 2014

bgrant0607 removed this from the v0.9 milestone Dec 3, 2014

bgrant0607 added the sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. label Feb 28, 2015

ghost mentioned this issue Apr 23, 2015

Attaching a group of MQ brokers to different persistent journal files fabric8io/fabric8#3802

Open

thockin removed the workload/sharded label May 29, 2015

bgrant0607 added team/ux and removed sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. labels May 13, 2016

bgrant0607 removed the team/ux (deprecated - do not use) label Feb 10, 2017

bgrant0607 added the triaged label Mar 9, 2017

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 21, 2017

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 20, 2018

bgrant0607 closed this as completed Jan 22, 2018

b3atlesfan pushed a commit to b3atlesfan/kubernetes that referenced this issue Feb 5, 2021

Merge pull request kubernetes#1064 from DavadDi/patch-2

802e92c

Fix subnet.env saved path

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reference implementation for sharded system #1064

Reference implementation for sharded system #1064

erictune commented Aug 27, 2014

lavalamp commented Aug 27, 2014

smarterclayton commented Aug 27, 2014

bgrant0607 commented Oct 2, 2014

bgrant0607 commented Oct 15, 2014

bgrant0607 commented Feb 10, 2017

fejta-bot commented Dec 21, 2017

fejta-bot commented Jan 20, 2018

Reference implementation for sharded system #1064

Reference implementation for sharded system #1064

Comments

erictune commented Aug 27, 2014

lavalamp commented Aug 27, 2014

smarterclayton commented Aug 27, 2014

bgrant0607 commented Oct 2, 2014

bgrant0607 commented Oct 15, 2014

bgrant0607 commented Feb 10, 2017

fejta-bot commented Dec 21, 2017

fejta-bot commented Jan 20, 2018