From: "Steven J. Magnani" <steve@digidescorp.com>
To: David Howells <dhowells@redhat.com>
Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org,
kamezawa.hiroyu@jp.fujitsu.com
Subject: Re: [PATCH][RESEND] nommu: add anonymous page memcg accounting
Date: Fri, 01 Oct 2010 11:41:07 -0500 [thread overview]
Message-ID: <1285951267.2558.69.camel@iscandar.digidescorp.com> (raw)
In-Reply-To: <5867.1285945621@redhat.com>
On Fri, 2010-10-01 at 16:07 +0100, David Howells wrote:
> Steve Magnani <steve@digidescorp.com> wrote:
>
> > If anything I think nommu is one of the better applications of memcg. Since
> > nommu typically embedded, being able to put potential memory pigs in a
> > sandbox so they can't destabilize the system is a Good Thing. That was my
> > motivation for doing this in the first place and it works quite well.
>
> I suspect it's not useful for a few reasons:
>
> (1) You don't normally run many applications on a NOMMU system. Typically,
> you'll run just one, probably threaded app, I think.
Not always.
>
> (2) In general, you won't be able to cull processes to make space. If the OOM
> killer runs your application has a bug in it.
Not always. Every now and then applications have to deal with
user-supplied input of some sort.
In our case it's a user-formatted disk drive that can have some
arbitrarily-sized FAT32 partition on which we are required to run
dosfsck. Now, dosfsck is the epitome of a memory pig; its memory
requirements scale with partition size, number of dentries, and any
damage encountered - none of which can be predicted. There is a set of
partitions we are able to check with no problem, but no guarantee the
user won't present us with one that would bring down the whole system,
were the OOM killer to get involved. Putting just dosfsck in its own
sandbox ensures this can't happen. See also my response to #4 below.
>
> (3) memcg has a huge overhead. 20 bytes per page! On a 4K page 32-bit
> system, that's nearly 5% of your RAM, assuming I understand the
> CGROUP_MEM_RES_CTLR config help text correctly.
When you use 16K pages, 20 bytes/page isn't so huge :)
>
> (4) There's no swapping, no page faults, no migration and little shareable
> memory. Being able to allocate large blocks of contiguous memory is much
> more important and much more of a bottleneck than this. The 5% of RAM
> lost makes that just that little bit harder.
>
> If it's memory sandboxing you require, ulimit might be sufficient for NOMMU
> mode.
dosfsck is written to handle memory allocation failures properly
(bailing out) but I have not been able to get this code to execute when
the system runs out of memory - the OOM killer gets invoked and that's
all she wrote. Will a ulimit violation return control back to the
process, or terminate it in some graceful manner?
>
> However, I suppose there's little harm in letting the patch in. I would guess
> the additions all optimise away if memcg isn't enabled.
>
> A question for you: why does struct page_cgroup need a page pointer? If an
> array of page_cgroup structs is allocated per array of page structs, then you
> should be able to use the array index to map between them.
Kame is probably better able to answer this.
Steve
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2010-10-01 16:41 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-10-01 10:35 Steven J. Magnani
2010-10-01 14:24 ` David Howells
2010-10-01 14:31 ` Steve Magnani
2010-10-03 18:10 ` Balbir Singh
2010-10-01 15:07 ` David Howells
2010-10-01 16:41 ` Steven J. Magnani [this message]
2010-10-03 18:17 ` Balbir Singh
2010-10-04 0:16 ` KAMEZAWA Hiroyuki
2010-10-03 18:08 ` Balbir Singh
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1285951267.2558.69.camel@iscandar.digidescorp.com \
--to=steve@digidescorp.com \
--cc=dhowells@redhat.com \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox