linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: David Rientjes <rientjes@google.com>
To: Oleg Nesterov <oleg@redhat.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
	KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>,
	KOSAKI Motohiro <kosaki.motohiro@jp.fujitsu.com>,
	Rik van Riel <riel@redhat.com>,
	linux-mm@kvack.org
Subject: Re: [patch 1/2] oom: avoid killing a task if a thread sharing its mm cannot be killed
Date: Mon, 16 Aug 2010 03:56:05 -0700 (PDT)	[thread overview]
Message-ID: <alpine.DEB.2.00.1008160350110.5305@chino.kir.corp.google.com> (raw)
In-Reply-To: <20100816055204.GA9498@redhat.com>

On Mon, 16 Aug 2010, Oleg Nesterov wrote:

> > There's no other way to detect threads in other thread groups that share
> > the same mm since subthreads of a process can have an oom_score_adj that
> > differ from that process, this includes the possibility of
> > OOM_SCORE_ADJ_MIN that we're interested in here.
> 
> Yes, you are right. Still, at least you can do
> 
> 	for_each_process(p) {
> 		if (p->mm != mm)
> 			continue;
> 		...
> 
> to quickly skip the thread group which doesn't share the same ->mm.
> 

Right, thanks.  I'll make that optimization and send out a second version 
of this series with the other changes you suggested.

> > > > -	if (p->signal->oom_score_adj == OOM_SCORE_ADJ_MIN) {
> > > > +	if (is_mm_unfreeable(p->mm)) {
> > >
> > > oom_badness() becomes O(n**2), not good.
> > >
> >
> > No, oom_badness() becomes O(n) from O(1); select_bad_process() becomes
> > slower for eligible tasks.
> 
> I meant, select_bad_process() becomes O(n^2). oom_badness() is O(n), yes.
> 

I'll follow my own suggestion for deferring this check to 
oom_kill_process() since it's certainly an unusual case if the tasks are 
sharing memory.  It'll require a second entire tasklist scan when it 
occurs, but definitely speeds up the common case.

> > > And, more importantly. This patch makes me think ->oom_score_adj should
> > > be moved from ->signal to ->mm.
> > >
> >
> > I did that several months ago but people were unhappy with how a parent's
> > oom_score_adj value would change if it did a vfork() and the child's
> > oom_score_adj value was changed prior to execve().
> 
> I see. But this patch in essence moves OOM_SCORE_ADJ_MIN from ->signal
> to ->mm (and btw personally I think this makes sense).
> 

Yes, and I still would have liked to embed it in struct mm_struct like I 
originally proposed, but I understand how some people didn't care much for 
the vfork() inheritance problem.  There are applications in the wild such 
as job schedulers that are OOM_DISABLE themselves and fork children and 
then reset their oom_adj value prior to exec.  So they do vfork() -> 
change child's oom_adj -> execve().  That currently works since the 
child's ->signal isn't shared (and before that, oom_adj was embedded in 
struct task_struct) and we can't change that behavior to also change the 
parent's oom_adj value at the same time because it shares an ->mm out from 
under them.

Thanks for reviewing the patches Oleg!

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

      reply	other threads:[~2010-08-16 10:56 UTC|newest]

Thread overview: 9+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-08-15  4:30 David Rientjes
2010-08-15  4:31 ` [patch 2/2] oom: kill all threads sharing oom killed task's mm David Rientjes
2010-08-15 15:45   ` Oleg Nesterov
2010-08-15 21:28     ` David Rientjes
2010-08-16  6:00       ` Oleg Nesterov
2010-08-15 15:18 ` [patch 1/2] oom: avoid killing a task if a thread sharing its mm cannot be killed Oleg Nesterov
2010-08-15 21:23   ` David Rientjes
2010-08-16  5:52     ` Oleg Nesterov
2010-08-16 10:56       ` David Rientjes [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=alpine.DEB.2.00.1008160350110.5305@chino.kir.corp.google.com \
    --to=rientjes@google.com \
    --cc=akpm@linux-foundation.org \
    --cc=kamezawa.hiroyu@jp.fujitsu.com \
    --cc=kosaki.motohiro@jp.fujitsu.com \
    --cc=linux-mm@kvack.org \
    --cc=oleg@redhat.com \
    --cc=riel@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox