From: Michal Hocko <mhocko@suse.cz>
To: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Cc: linux-mm@kvack.org, rientjes@google.com, hannes@cmpxchg.org,
tj@kernel.org, akpm@linux-foundation.org,
linux-kernel@vger.kernel.org
Subject: Re: [RFC] panic_on_oom_timeout
Date: Thu, 11 Jun 2015 16:18:13 +0200 [thread overview]
Message-ID: <20150611141813.GA14088@dhcp22.suse.cz> (raw)
In-Reply-To: <201506112212.JAG26531.FLSVFMOQJOtOHF@I-love.SAKURA.ne.jp>
On Thu 11-06-15 22:12:40, Tetsuo Handa wrote:
> Michal Hocko wrote:
[...]
> > > The moom_work used by SysRq-f sometimes cannot be executed
> > > because some work which is processed before the moom_work is processed is
> > > stalled for unbounded amount of time due to looping inside the memory
> > > allocator.
> >
> > Wouldn't wq code pick up another worker thread to execute the work.
> > There is also a rescuer thread as the last resort AFAIR.
> >
>
> Below is an example of moom_work lockup in v4.1-rc7 from
> http://I-love.SAKURA.ne.jp/tmp/serial-20150611.txt.xz
>
> ----------
> [ 171.710406] sysrq: SysRq : Manual OOM execution
> [ 171.720193] kworker/2:9 invoked oom-killer: gfp_mask=0xd0, order=0, oom_score_adj=0
> [ 171.722699] kworker/2:9 cpuset=/ mems_allowed=0
> [ 171.724603] CPU: 2 PID: 11016 Comm: kworker/2:9 Not tainted 4.1.0-rc7 #3
> [ 171.726817] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 07/31/2013
> [ 171.729727] Workqueue: events moom_callback
> (...snipped...)
> [ 258.302016] sysrq: SysRq : Manual OOM execution
Wow, this is a _lot_. I was aware that workqueues might be overloaded.
We have seen that in real loads and that led to
http://marc.info/?l=linux-kernel&m=141456398425553 wher the rescuer
didn't handle pending work properly. I thought that the fix helped in
the end. But 1.5 minutes is indeed unexpected for me.
This of course disqualifies DELAYED_WORK for anything that has at least
reasonable time expectations which is the case here.
[...]
> I think that the basic rule for handling OOM condition is that "Do not trust
> anybody, for even the kswapd and rescuer threads can be waiting for lock or
> allocation. Decide when to give up at administrator's own risk, and choose
> another mm struct or call panic()".
Yes I agree with this and checking the timeout at select_bad_process
time sounds much more plausible in that regard.
Thanks!
--
Michal Hocko
SUSE Labs
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2015-06-11 14:18 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-06-09 17:03 Michal Hocko
2015-06-10 12:20 ` Tetsuo Handa
2015-06-10 14:28 ` Michal Hocko
2015-06-10 15:56 ` Michal Hocko
2015-06-12 15:23 ` Tetsuo Handa
2015-06-15 12:45 ` Michal Hocko
2015-06-16 13:14 ` Tetsuo Handa
2015-06-16 13:46 ` Michal Hocko
2015-06-17 12:16 ` Tetsuo Handa
2015-06-17 12:36 ` Michal Hocko
2015-06-11 13:12 ` Tetsuo Handa
2015-06-11 14:18 ` Michal Hocko [this message]
2015-06-11 14:45 ` Tetsuo Handa
2015-06-11 15:38 ` Michal Hocko
2015-06-17 12:11 ` [RFC -v2] panic_on_oom_timeout Michal Hocko
2015-06-17 12:31 ` Tetsuo Handa
2015-06-17 12:51 ` Michal Hocko
2015-06-17 13:24 ` Michal Hocko
2015-07-29 11:55 ` Michal Hocko
2015-07-29 13:20 ` Tetsuo Handa
2015-06-17 13:59 ` Tetsuo Handa
2015-06-17 15:41 ` Michal Hocko
2015-06-19 11:30 ` Tetsuo Handa
2015-06-19 15:36 ` Michal Hocko
2015-06-19 18:54 ` Tetsuo Handa
2015-06-20 7:57 ` Tetsuo Handa
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20150611141813.GA14088@dhcp22.suse.cz \
--to=mhocko@suse.cz \
--cc=akpm@linux-foundation.org \
--cc=hannes@cmpxchg.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=penguin-kernel@I-love.SAKURA.ne.jp \
--cc=rientjes@google.com \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox