From: Michal Hocko <mhocko@kernel.org>
To: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Cc: akpm@linux-foundation.org, rientjes@google.com, linux-mm@kvack.org
Subject: Re: [PATCH 2/2] mm,oom: Do oom_task_origin() test in oom_badness().
Date: Mon, 23 May 2016 13:21:41 +0200 [thread overview]
Message-ID: <20160523112141.GO2278@dhcp22.suse.cz> (raw)
In-Reply-To: <201605231959.IHB04619.OtLFOJSQHFVMFO@I-love.SAKURA.ne.jp>
On Mon 23-05-16 19:59:30, Tetsuo Handa wrote:
> Michal Hocko wrote:
> > On Sat 21-05-16 11:01:30, Tetsuo Handa wrote:
> > > Currently, oom_scan_process_thread() returns OOM_SCAN_SELECT if
> > > oom_task_origin() returned true. But this might cause OOM livelock.
> > >
> > > If the OOM killer finds a task with oom_task_origin(task) == true,
> > > it means that that task is either inside try_to_unuse() from swapoff
> > > path or unmerge_and_remove_all_rmap_items() from ksm's run_store path.
> > >
> > > Let's take a look at try_to_unuse() as an example. Although there is
> > > signal_pending() test inside the iteration loop, there are operations
> > > (e.g. mmput(), wait_on_page_*()) which might block in unkillable state
> > > waiting for other threads which might allocate memory.
> > >
> > > Therefore, sending SIGKILL to a task with oom_task_origin(task) == true
> > > can not guarantee that that task shall not stuck at unkillable waits.
> > > Once the OOM reaper reaped that task's memory (or gave up reaping it),
> > > the OOM killer must not select that task again when oom_task_origin(task)
> > > returned true. We need to select different victims until that task can
> > > hit signal_pending() test or finish the iteration loop.
> > >
> > > Since oom_badness() is a function which returns score of the given thread
> > > group with eligibility/livelock test, it is more natural and safer to let
> > > oom_badness() return highest score when oom_task_origin(task) == true.
> > >
> > > This patch moves oom_task_origin() test from oom_scan_process_thread() to
> > > after MMF_OOM_REAPED test inside oom_badness(), changes the callers to
> > > receive the score using "unsigned long" variable, and eliminates
> > > OOM_SCAN_SELECT path in the callers.
> >
> > I do not think this is the right approach. If the problem is real then
> > the patch just papers over deficiency of the oom_task_origin which
> > should be addressed instead.
> >
> > > Signed-off-by: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
> >
> > Nacked-by: Michal Hocko <mhocko@suse.com>
> >
>
> Quoting from http://lkml.kernel.org/r/20160523075524.GG2278@dhcp22.suse.cz :
> > > Is it guaranteed that try_to_unuse() from swapoff is never blocked on memory
> > > allocation (e.g. mmput(), wait_on_page_*()) ?
> >
> > It shouldn't. All the waiting should be killable. If not it is a bug and
> > should be fixed.
>
> So, you think that we should replace mmput() with mmput_async(), lock_page()
> with lock_page_killable(), wait_on_page_bit() with wait_on_page_bit_killable(),
> mutex_lock() with mutex_lock_killable(), down_read() with down_read_killable()
> and so on, don't you?
Yes where appropriate. And even more importantly. First think whether
you are trying to address a real problem. It doesn't make any sense to
complicate the code for something that is simply not realistic. In this
particular case we are talking about _root_ doing a potentially
expensive operation and I am pretty sure no reasonable admin would do it
under a high memory pressure. So all the cases you are trying to address
are maybe completely unrealistic.
--
Michal Hocko
SUSE Labs
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
prev parent reply other threads:[~2016-05-23 11:21 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-05-21 2:01 Tetsuo Handa
2016-05-23 8:19 ` Michal Hocko
2016-05-23 10:59 ` Tetsuo Handa
2016-05-23 11:21 ` Michal Hocko [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160523112141.GO2278@dhcp22.suse.cz \
--to=mhocko@kernel.org \
--cc=akpm@linux-foundation.org \
--cc=linux-mm@kvack.org \
--cc=penguin-kernel@I-love.SAKURA.ne.jp \
--cc=rientjes@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox