From: David Rientjes <rientjes@google.com>
To: Oleg Nesterov <oleg@redhat.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
Rik van Riel <riel@redhat.com>,
linux-mm@kvack.org
Subject: Re: [patch 1/2] oom: avoid killing a task if a thread sharing its mm cannot be killed
Date: Mon, 16 Aug 2010 03:56:05 -0700 (PDT) [thread overview]
Message-ID: <alpine.DEB.2.00.1008160350110.5305@chino.kir.corp.google.com> (raw)
In-Reply-To: <20100816055204.GA9498@redhat.com>
On Mon, 16 Aug 2010, Oleg Nesterov wrote:
> > There's no other way to detect threads in other thread groups that share
> > the same mm since subthreads of a process can have an oom_score_adj that
> > differ from that process, this includes the possibility of
> > OOM_SCORE_ADJ_MIN that we're interested in here.
>
> Yes, you are right. Still, at least you can do
>
> for_each_process(p) {
> if (p->mm != mm)
> continue;
> ...
>
> to quickly skip the thread group which doesn't share the same ->mm.
>
Right, thanks. I'll make that optimization and send out a second version
of this series with the other changes you suggested.
> > > > - if (p->signal->oom_score_adj == OOM_SCORE_ADJ_MIN) {
> > > > + if (is_mm_unfreeable(p->mm)) {
> > >
> > > oom_badness() becomes O(n**2), not good.
> > >
> >
> > No, oom_badness() becomes O(n) from O(1); select_bad_process() becomes
> > slower for eligible tasks.
>
> I meant, select_bad_process() becomes O(n^2). oom_badness() is O(n), yes.
>
I'll follow my own suggestion for deferring this check to
oom_kill_process() since it's certainly an unusual case if the tasks are
sharing memory. It'll require a second entire tasklist scan when it
occurs, but definitely speeds up the common case.
> > > And, more importantly. This patch makes me think ->oom_score_adj should
> > > be moved from ->signal to ->mm.
> > >
> >
> > I did that several months ago but people were unhappy with how a parent's
> > oom_score_adj value would change if it did a vfork() and the child's
> > oom_score_adj value was changed prior to execve().
>
> I see. But this patch in essence moves OOM_SCORE_ADJ_MIN from ->signal
> to ->mm (and btw personally I think this makes sense).
>
Yes, and I still would have liked to embed it in struct mm_struct like I
originally proposed, but I understand how some people didn't care much for
the vfork() inheritance problem. There are applications in the wild such
as job schedulers that are OOM_DISABLE themselves and fork children and
then reset their oom_adj value prior to exec. So they do vfork() ->
change child's oom_adj -> execve(). That currently works since the
child's ->signal isn't shared (and before that, oom_adj was embedded in
struct task_struct) and we can't change that behavior to also change the
parent's oom_adj value at the same time because it shares an ->mm out from
under them.
Thanks for reviewing the patches Oleg!
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
prev parent reply other threads:[~2010-08-16 10:56 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-08-15 4:30 David Rientjes
2010-08-15 4:31 ` [patch 2/2] oom: kill all threads sharing oom killed task's mm David Rientjes
2010-08-15 15:45 ` Oleg Nesterov
2010-08-15 21:28 ` David Rientjes
2010-08-16 6:00 ` Oleg Nesterov
2010-08-15 15:18 ` [patch 1/2] oom: avoid killing a task if a thread sharing its mm cannot be killed Oleg Nesterov
2010-08-15 21:23 ` David Rientjes
2010-08-16 5:52 ` Oleg Nesterov
2010-08-16 10:56 ` David Rientjes [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=alpine.DEB.2.00.1008160350110.5305@chino.kir.corp.google.com \
--to=rientjes@google.com \
--cc=akpm@linux-foundation.org \
--cc=kamezawa.hiroyu@jp.fujitsu.com \
--cc=kosaki.motohiro@jp.fujitsu.com \
--cc=linux-mm@kvack.org \
--cc=oleg@redhat.com \
--cc=riel@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox