From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail-ed1-f69.google.com (mail-ed1-f69.google.com [209.85.208.69]) by kanga.kvack.org (Postfix) with ESMTP id 6AA8B6B0005 for ; Mon, 6 Aug 2018 07:09:37 -0400 (EDT) Received: by mail-ed1-f69.google.com with SMTP id i26-v6so4103650edr.4 for ; Mon, 06 Aug 2018 04:09:37 -0700 (PDT) Received: from mx1.suse.de (mx2.suse.de. [195.135.220.15]) by mx.google.com with ESMTPS id r26-v6si8718004edm.42.2018.08.06.04.09.36 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 06 Aug 2018 04:09:36 -0700 (PDT) Date: Mon, 6 Aug 2018 13:09:34 +0200 From: Michal Hocko Subject: Re: WARNING in try_charge Message-ID: <20180806110934.GJ19540@dhcp22.suse.cz> References: <0000000000005e979605729c1564@google.com> <20180806091552.GE19540@dhcp22.suse.cz> <20180806094827.GH19540@dhcp22.suse.cz> <884de816-671a-44d4-a6a1-2ad7eff53715@I-love.SAKURA.ne.jp> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <884de816-671a-44d4-a6a1-2ad7eff53715@I-love.SAKURA.ne.jp> Sender: owner-linux-mm@kvack.org List-ID: To: Tetsuo Handa Cc: Dmitry Vyukov , syzbot , cgroups@vger.kernel.org, Johannes Weiner , LKML , Linux-MM , syzkaller-bugs , Vladimir Davydov On Mon 06-08-18 19:47:00, Tetsuo Handa wrote: > On 2018/08/06 19:39, Dmitry Vyukov wrote: > > On Mon, Aug 6, 2018 at 11:48 AM, Michal Hocko wrote: > >> Btw. running with the above diff on top might help us to ideantify > >> whether this is a pre-mature warning or a valid one. Still useful to > >> find out. > > Since syzbot already found a syz reproducer, you can ask syzbot to test it. > > > > > The bug report has a reproducer, so you can run it with the patch. Or > > ask syzbot to test your patch: > > https://github.com/google/syzkaller/blob/master/docs/syzbot.md#testing-patches > > Which basically boils down to saying: > > > > #syz test: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git > > master > > Excuse me, but this is linux-next only problem. Therefore, If this really is a linux-next only problem then please retest with the current linux-next which has dropped the and replaced the group oom code. > #syz test: git://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git master > > diff --git a/mm/memcontrol.c b/mm/memcontrol.c > index 4603ad75c9a9..852cd3dbdcd9 100644 > --- a/mm/memcontrol.c > +++ b/mm/memcontrol.c > @@ -1388,6 +1388,8 @@ static bool mem_cgroup_out_of_memory(struct mem_cgroup *memcg, gfp_t gfp_mask, > bool ret; > > mutex_lock(&oom_lock); > + pr_info("task=%s pid=%d invoked memcg oom killer. oom_victim=%d\n", > + current->comm, current->pid, tsk_is_oom_victim(current)); > ret = out_of_memory(&oc); > mutex_unlock(&oom_lock); > return ret; > > F.Y.I. Waiting until __mmput() completes (with timeout using OOM score feedback) > ( https://syzkaller.appspot.com/x/patch.diff?x=101e449c400000 ) solves this race. Which just means that something else is broken. Seriously, timout is not going to fix anything. It merely changes the picture. -- Michal Hocko SUSE Labs