From: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
To: rientjes@google.com
Cc: mhocko@kernel.org, akpm@linux-foundation.org, mgorman@suse.de,
torvalds@linux-foundation.org, oleg@redhat.com, hughd@google.com,
andrea@kernel.org, riel@redhat.com, linux-mm@kvack.org,
linux-kernel@vger.kernel.org, mhocko@suse.com
Subject: Re: [PATCH] mm,oom: Re-enable OOM killer using timers.
Date: Wed, 13 Jan 2016 21:11:30 +0900 [thread overview]
Message-ID: <201601132111.GIG81705.LFOOHFOtQJSMVF@I-love.SAKURA.ne.jp> (raw)
In-Reply-To: <alpine.DEB.2.10.1601121717220.17063@chino.kir.corp.google.com>
David Rientjes wrote:
> I'm not sure why you are proposing adding both of these in the same patch;
> they have very different usecases and semantics.
Because both of these are for tuning the OOM killer.
>
> oomkiller_holdoff_ms, as indicated by the changelog, seems to be
> correcting some deficiency in the oom reaper.
It is not deficiency in the OOM reaper but deficiency in the OOM killer
or in the page allocator.
> I haven't reviewed that,
> but it seems like something that wouldn't need to be fixed with a
> timeout-based solution.
The problem is that it takes some amount of time to return memory to
freelist after memory was reclaimed. Unless we add a callback mechanism
for notifying that the memory used by TIF_MEMDIE task was reclaimed and
returned to freelist, there is no means to fix this problem.
> We either know if we have completed oom reaping
> or we haven't, it is something we should easily be able to figure out and
> not require heuristics such as this.
>
> This does not seem to have anything to do with current upstream code that
> does not have the oom reaper since the oom killer clearly has
> synchronization through oom_lock and we carefully defer for TIF_MEMDIE
> processes and abort for those that have not yet fully exited to free its
> memory. If patches are going to be proposed on top of the oom reaper,
> please explicitly state that.
The OOM reaper is irrelevant. The OOM reaper is merely an accelerator for
reclaiming memory earlier than now.
>
> I believe any such race described in the changelog could be corrected by
> deferring the oom killer entirely until the oom reaper has been able to
> free memory or the oom victim has fully exited. I haven't reviewed that,
> so I can't speak definitively, but I think we should avoid _any_ timeout
> based solution if possible and there's no indication this is the only way
> to solve such a problem.
The OOM killer can not know when the reclaimed memory is returned to freelist
(and therefore get_page_from_freelist() might succeed).
Currently timeout is the only way to mitigate this problem.
>
> oomkiller_victim_wait_ms seems to be another manifestation of the same
> patch which has been nack'd over and over again.
I believe the situation is changing due to introduction of the OOM reaper.
> It does not address the
> situation where there are no additional eligible processes to kill and we
> end up panicking the machine when additional access to memory reserves may
> have allowed the victim to exit. Randomly killing additional processes
> makes that problem worse since if they cannot exit (which may become more
> likely than not if all victims are waiting on a mutex held by an
> allocating thread).
>
> My solution for that has always been to grant allocating threads temporary
> access to memory reserves in the hope that the mutex be dropped and the
> victim may make forward progress. We have this implemented internally and
> I've posted a test module that easily exhibits the problem and how it is
> fixed.
Those who use panic_on_oom = 1 expect that the system triggers kernel panic
rather than stall forever. This is a translation of administrator's wish that
"Please press SysRq-c on behalf of me if the memory exhausted. In that way,
I don't need to stand by in front of the console twenty-four seven."
Those who use panic_on_oom = 0 expect that the OOM killer solves OOM condition
rather than stall forever. This is a translation of administrator's wish that
"Please press SysRq-f on behalf of me if the memory exhausted. In that way,
I don't need to stand by in front of the console twenty-four seven."
However, since the OOM killer never presses SysRq-f again until the OOM
victim terminates, this is annoying administrators.
Administrator: "I asked you to press SysRq-f on behalf of me. Why did you
let the system stalled forever?"
The OOM killer: "I did. The system did not recover from OOM condition."
Administrator: "Why you don't try pressing SysRq-f again on behalf of me?"
The OOM killer: "I am not programmed to do so."
Administrator: "You are really inattentive assistant. OK. Here is a patch
that programs you to press SysRq-f again on behalf of me."
What I want to say to the OOM killer is "Please don't toss the OOM killer's
duty away." when the OOM killer answered "I did something with a hope that
OOM condition is solved". And MM people are still NACKing administrator's
innocent wish.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2016-01-13 12:11 UTC|newest]
Thread overview: 19+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-01-07 11:26 Tetsuo Handa
2016-01-13 1:36 ` David Rientjes
2016-01-13 12:11 ` Tetsuo Handa [this message]
2016-01-13 16:26 ` Michal Hocko
2016-01-13 16:56 ` Johannes Weiner
2016-01-13 18:01 ` Michal Hocko
2016-01-14 11:26 ` Tetsuo Handa
2016-01-14 22:01 ` David Rientjes
2016-01-14 22:58 ` Johannes Weiner
2016-01-14 23:09 ` David Rientjes
2016-01-15 10:36 ` Tetsuo Handa
2016-01-19 23:13 ` David Rientjes
2016-01-20 14:36 ` Tetsuo Handa
2016-01-20 23:49 ` David Rientjes
2016-01-21 11:44 ` Tetsuo Handa
2016-01-21 23:15 ` David Rientjes
2016-01-22 13:59 ` Tetsuo Handa
2016-01-22 14:57 ` Johannes Weiner
2016-01-26 23:44 ` David Rientjes
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=201601132111.GIG81705.LFOOHFOtQJSMVF@I-love.SAKURA.ne.jp \
--to=penguin-kernel@i-love.sakura.ne.jp \
--cc=akpm@linux-foundation.org \
--cc=andrea@kernel.org \
--cc=hughd@google.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=mhocko@kernel.org \
--cc=mhocko@suse.com \
--cc=oleg@redhat.com \
--cc=riel@redhat.com \
--cc=rientjes@google.com \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox