From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-wm0-f44.google.com (mail-wm0-f44.google.com [74.125.82.44]) by kanga.kvack.org (Postfix) with ESMTP id 957E26B0005 for ; Wed, 17 Feb 2016 10:01:30 -0500 (EST) Received: by mail-wm0-f44.google.com with SMTP id c200so218091774wme.0 for ; Wed, 17 Feb 2016 07:01:30 -0800 (PST) Received: from mail-wm0-f50.google.com (mail-wm0-f50.google.com. [74.125.82.50]) by mx.google.com with ESMTPS id ik8si714897wjb.229.2016.02.17.07.01.29 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 17 Feb 2016 07:01:29 -0800 (PST) Received: by mail-wm0-f50.google.com with SMTP id g62so164604474wme.1 for ; Wed, 17 Feb 2016 07:01:29 -0800 (PST) Date: Wed, 17 Feb 2016 16:01:27 +0100 From: Michal Hocko Subject: Re: [PATCH 2/6] mm,oom: don't abort on exiting processes when selecting a victim. Message-ID: <20160217150127.GR29196@dhcp22.suse.cz> References: <201602171928.GDE00540.SLJMOFFQOHtFVO@I-love.SAKURA.ne.jp> <201602171930.AII18204.FMOSVFQFOJtLOH@I-love.SAKURA.ne.jp> <20160217125418.GF29196@dhcp22.suse.cz> <201602172207.GAG52105.FOtMJOFQOVSFHL@I-love.SAKURA.ne.jp> <20160217140006.GM29196@dhcp22.suse.cz> <201602172339.JBJ57868.tSQVJLHMFFOOFO@I-love.SAKURA.ne.jp> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <201602172339.JBJ57868.tSQVJLHMFFOOFO@I-love.SAKURA.ne.jp> Sender: owner-linux-mm@kvack.org List-ID: To: Tetsuo Handa Cc: akpm@linux-foundation.org, rientjes@google.com, mgorman@suse.de, oleg@redhat.com, torvalds@linux-foundation.org, hughd@google.com, andrea@kernel.org, riel@redhat.com, linux-mm@kvack.org, linux-kernel@vger.kernel.org On Wed 17-02-16 23:39:47, Tetsuo Handa wrote: > Michal Hocko wrote: > > On Wed 17-02-16 22:07:31, Tetsuo Handa wrote: > > > Michal Hocko wrote: > > > > On Wed 17-02-16 19:30:41, Tetsuo Handa wrote: > > > > > >From 22bd036766e70f0df38c38f3ecc226e857d20faf Mon Sep 17 00:00:00 2001 > > > > > From: Tetsuo Handa > > > > > Date: Wed, 17 Feb 2016 16:30:59 +0900 > > > > > Subject: [PATCH 2/6] mm,oom: don't abort on exiting processes when selecting a victim. > > > > > > > > > > Currently, oom_scan_process_thread() returns OOM_SCAN_ABORT when there > > > > > is a thread which is exiting. But it is possible that that thread is > > > > > blocked at down_read(&mm->mmap_sem) in exit_mm() called from do_exit() > > > > > whereas one of threads sharing that memory is doing a GFP_KERNEL > > > > > allocation between down_write(&mm->mmap_sem) and up_write(&mm->mmap_sem) > > > > > (e.g. mmap()). Under such situation, the OOM killer does not choose a > > > > > victim, which results in silent OOM livelock problem. > > > > > > > > Again, such a thread/task will have fatal_signal_pending and so have > > > > access to memory reserves. So the text is slightly misleading imho. > > > > Sure if the memory reserves are depleted then we will not move on but > > > > then it is not clear whether the current patch helps either. > > > > > > I don't think so. > > > Please see http://lkml.kernel.org/r/201602151958.HCJ48972.FFOFOLMHSQVJtO@I-love.SAKURA.ne.jp . > > > > I have missed this one. Reading... > > > > Hmm, so you are not referring to OOM killed task but naturally exiting > > thread which is racing with the OOM killer. I guess you have a point > > there! Could you update the changelog with the above example and repost > > please? > > > Yes and I resent that patch as v2. > > I think that the same problem exists for any task_will_free_mem()-based > optimizations. Can we eliminate them because these optimized paths are not > handled by the OOM reaper which means that we have no means other than > "[PATCH 5/6] mm,oom: Re-enable OOM killer using timers." ? Well, only oom_kill_process usage of task_will_free_mem might be a problem because out_of_memory operates on the current task so it must be in the allocation path and access to memory reserves should help it to continue. Wrt. oom_kill_process this will be more tricky. I guess we want to teach oom_reaper to operate on such a task which would be a more robust solution than removing the check altogether. -- Michal Hocko SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: email@kvack.org