From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id B8374C433F5 for ; Sat, 26 Feb 2022 01:20:25 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 180328D0002; Fri, 25 Feb 2022 20:20:25 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 1080E8D0001; Fri, 25 Feb 2022 20:20:25 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id F11688D0002; Fri, 25 Feb 2022 20:20:24 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0239.hostedemail.com [216.40.44.239]) by kanga.kvack.org (Postfix) with ESMTP id DE0A28D0001 for ; Fri, 25 Feb 2022 20:20:24 -0500 (EST) Received: from smtpin26.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay01.hostedemail.com (Postfix) with ESMTP id 9F948181CB2C0 for ; Sat, 26 Feb 2022 01:20:24 +0000 (UTC) X-FDA: 79183175568.26.CE5A8D7 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by imf16.hostedemail.com (Postfix) with ESMTP id 0CBAF180003 for ; Sat, 26 Feb 2022 01:20:23 +0000 (UTC) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id D6897B833BE; Sat, 26 Feb 2022 01:20:22 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 2C31EC340E7; Sat, 26 Feb 2022 01:20:21 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linux-foundation.org; s=korg; t=1645838421; bh=MNZX72o3nFMbB/Wwvsuo/JcaX7p+1wDXXsOIDM8WRLU=; h=Date:From:To:Subject:In-Reply-To:References:From; b=zmWl8r1zxiQ0jrS8v8GX2dMS2IbwAeJT3rJc4OKcvYDvfgriT/2Uh+pJMTxjwkmga dec2ncVjw75/YQsfaJ41+6Vkm0UlnLtet/OAZkUCcbZEBLuq/+WccgxsVMq89VFcHB 3RaCOTxB6U4AZBiA/00lgniy6doBCdZfikWpFUHY= Date: Fri, 25 Feb 2022 17:20:20 -0800 From: Andrew Morton To: Shakeel Butt , =?ISO-8859-1?Q?"Michal_Koutn=FD"?= , Johannes Weiner , Michal Hocko , Roman Gushchin , Ivan Babrou , cgroups@vger.kernel.org, linux-mm@kvack.org, linux-kernel@vger.kernel.org, Daniel Dao , stable@vger.kernel.org Subject: Re: [PATCH] memcg: async flush memcg stats from perf sensitive codepaths Message-Id: <20220225172020.b3e59e11a0a3dd15e0d34141@linux-foundation.org> In-Reply-To: <20220225165842.561d3a475310aeab86a2d653@linux-foundation.org> References: <20220226002412.113819-1-shakeelb@google.com> <20220225165842.561d3a475310aeab86a2d653@linux-foundation.org> X-Mailer: Sylpheed 3.7.0 (GTK+ 2.24.33; x86_64-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 0CBAF180003 X-Stat-Signature: 5che4ok3aoz5ajzyh5mj3dfcsjqinrgz X-Rspam-User: Authentication-Results: imf16.hostedemail.com; dkim=pass header.d=linux-foundation.org header.s=korg header.b=zmWl8r1z; dmarc=none; spf=pass (imf16.hostedemail.com: domain of akpm@linux-foundation.org designates 145.40.68.75 as permitted sender) smtp.mailfrom=akpm@linux-foundation.org X-Rspamd-Server: rspam03 X-HE-Tag: 1645838423-396097 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Fri, 25 Feb 2022 16:58:42 -0800 Andrew Morton wrote: > On Fri, 25 Feb 2022 16:24:12 -0800 Shakeel Butt wrote: > > > Daniel Dao has reported [1] a regression on workloads that may trigger > > a lot of refaults (anon and file). The underlying issue is that flushing > > rstat is expensive. Although rstat flush are batched with (nr_cpus * > > MEMCG_BATCH) stat updates, it seems like there are workloads which > > genuinely do stat updates larger than batch value within short amount of > > time. Since the rstat flush can happen in the performance critical > > codepaths like page faults, such workload can suffer greatly. > > > > The easiest fix for now is for performance critical codepaths trigger > > the rstat flush asynchronously. This patch converts the refault codepath > > to use async rstat flush. In addition, this patch has premptively > > converted mem_cgroup_wb_stats and shrink_node to also use the async > > rstat flush as they may also similar performance regressions. > > Gee we do this trick a lot and gee I don't like it :( > > a) if we're doing too much work then we're doing too much work. > Punting that work over to a different CPU or thread doesn't alter > that - it in fact adds more work. > > b) there's an assumption here that the flusher is able to keep up > with the producer. What happens if that isn't the case? Do we > simply wind up the deferred items until the system goes oom? > > What happens if there's a producer running on each CPU? Can the > flushers keep up? > > Pathologically, what happens if the producer is running > task_is_realtime() on a single-CPU system? Or if there's a > task_is_realtime() producer running on every CPU? The flusher never > gets to run and we're dead? Not some theoretical thing, btw. See how __read_swap_cache_async() just got its sins exposed by real-time tasks: https://lkml.kernel.org/r/20220221111749.1928222-1-cgel.zte@gmail.com