Cap docker resource cgroup's limit #9881

dchen1107 · 2015-06-16T17:58:01Z

The only unrecovered node is caused by docker memory leakage. It is a known issue since docker 1.3.0, might even earlier version (moby/moby#9139). Docker 1.7.0-rcX (the one I am currently validating) should have a fix for it. Once you restart docker, the problem should be gone, and the node should be recovered. I saw a similar problem before. On each node, there is a monit healthchecking docker daemon process periodically, in most cases, the docker in such bad state will be restarted by monit.

Before we have such fix from docker 1.7, we can set docker's hard memory limit to 70% of node capacity since we already put docker into a cgroup with unlimited limit today. This could be a temporary workaround recovering node from bad state.

Related docker issue: moby/moby#9139

cc/ @lavalamp @bprashanth

dchen1107 · 2015-06-16T18:11:36Z

cc/ @vmarmol

dchen1107 added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/node Categorizes an issue or PR as relevant to SIG Node. labels Jun 16, 2015

dchen1107 self-assigned this Jun 16, 2015

dchen1107 added this to the v1.0 milestone Jun 16, 2015

dchen1107 mentioned this issue Jun 16, 2015

monit is too aggressive in killing docker #9412

Closed

bprashanth mentioned this issue Jun 17, 2015

99%ile end-to-end pod startup time w/ prepulled images < 5s on 100 node, 3000 pod cluster; linear time to # nodes and pods #3954

Closed

dchen1107 mentioned this issue Jun 17, 2015

Configured resource-only container /docker-daemon with 70% of node me… #9961

Merged

saad-ali closed this as completed in #9961 Jun 17, 2015

jon-shanks mentioned this issue Aug 13, 2015

CoreOS 723.3.0 memoryinfo issue hence hits docker memory leak bug #12652

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cap docker resource cgroup's limit #9881

Cap docker resource cgroup's limit #9881

dchen1107 commented Jun 16, 2015

dchen1107 commented Jun 16, 2015

Cap docker resource cgroup's limit #9881

Cap docker resource cgroup's limit #9881

Comments

dchen1107 commented Jun 16, 2015

dchen1107 commented Jun 16, 2015