From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id BA5B8C433F5 for ; Wed, 9 Mar 2022 12:32:50 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 38AEE8D0002; Wed, 9 Mar 2022 07:32:50 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 3391A8D0001; Wed, 9 Mar 2022 07:32:50 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 228988D0002; Wed, 9 Mar 2022 07:32:50 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (relay.a.hostedemail.com [64.99.140.24]) by kanga.kvack.org (Postfix) with ESMTP id 14E498D0001 for ; Wed, 9 Mar 2022 07:32:50 -0500 (EST) Received: from smtpin04.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay13.hostedemail.com (Postfix) with ESMTP id CD010615FC for ; Wed, 9 Mar 2022 12:32:49 +0000 (UTC) X-FDA: 79224786858.04.2BF814D Received: from outbound-smtp44.blacknight.com (outbound-smtp44.blacknight.com [46.22.136.52]) by imf19.hostedemail.com (Postfix) with ESMTP id B450B1A0013 for ; Wed, 9 Mar 2022 12:32:48 +0000 (UTC) Received: from mail.blacknight.com (pemlinmail04.blacknight.ie [81.17.254.17]) by outbound-smtp44.blacknight.com (Postfix) with ESMTPS id 382FBF8058 for ; Wed, 9 Mar 2022 12:32:47 +0000 (GMT) Received: (qmail 26046 invoked from network); 9 Mar 2022 12:32:47 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.17.223]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 9 Mar 2022 12:32:46 -0000 Date: Wed, 9 Mar 2022 12:32:45 +0000 From: Mel Gorman To: Eric Dumazet Cc: Eric Dumazet , Andrew Morton , linux-kernel , linux-mm , Vlastimil Babka , Michal Hocko , Shakeel Butt , Wei Xu , Greg Thelen , Hugh Dickins , David Rientjes Subject: Re: [PATCH v2] mm/page_alloc: call check_new_pages() while zone spinlock is not held Message-ID: <20220309123245.GI15701@techsingularity.net> References: <20220304170215.1868106-1-eric.dumazet@gmail.com> <20220307091541.GD15701@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) X-Rspamd-Queue-Id: B450B1A0013 X-Stat-Signature: 4j6pbg5guf44r3qrmmzccd48k3y31kap X-Rspam-User: Authentication-Results: imf19.hostedemail.com; dkim=none; spf=pass (imf19.hostedemail.com: domain of mgorman@techsingularity.net designates 46.22.136.52 as permitted sender) smtp.mailfrom=mgorman@techsingularity.net; dmarc=none X-Rspamd-Server: rspam07 X-HE-Tag: 1646829168-286683 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Tue, Mar 08, 2022 at 03:49:48PM -0800, Eric Dumazet wrote: > On Mon, Mar 7, 2022 at 1:15 AM Mel Gorman wrote: > > > > On Fri, Mar 04, 2022 at 09:02:15AM -0800, Eric Dumazet wrote: > > > From: Eric Dumazet > > > > > > For high order pages not using pcp, rmqueue() is currently calling > > > the costly check_new_pages() while zone spinlock is held, > > > and hard irqs masked. > > > > > > This is not needed, we can release the spinlock sooner to reduce > > > zone spinlock contention. > > > > > > Note that after this patch, we call __mod_zone_freepage_state() > > > before deciding to leak the page because it is in bad state. > > > > > > v2: We need to keep interrupts disabled to call __mod_zone_freepage_state() > > > > > > Signed-off-by: Eric Dumazet > > > Cc: Mel Gorman > > > Cc: Vlastimil Babka > > > Cc: Michal Hocko > > > Cc: Shakeel Butt > > > Cc: Wei Xu > > > Cc: Greg Thelen > > > Cc: Hugh Dickins > > > Cc: David Rientjes > > > > Ok, this is only more expensive in the event pages on the free list have > > been corrupted whch is already very unlikely so thanks! > > > > Acked-by: Mel Gorman > > > > One remaining question is: > > After your patch ("mm/page_alloc: allow high-order pages to be stored > on the per-cpu lists"), > do we want to change check_pcp_refill()/check_new_pcp() to check all pages, > and not only the head ? > We should because it was an oversight. Thanks for pointing that out. > Or was it a conscious choice of yours ? > (I presume part of the performance gains came from > not having to bring ~7 cache lines per 32KB chunk on x86) > There will be a performance penalty due to the check but it's a correctness vs performance issue. This? It's boot tested only. --8<-- mm/page_alloc: check high-order pages for corruption during PCP operations Eric Dumazet pointed out that commit 44042b449872 ("mm/page_alloc: allow high-order pages to be stored on the per-cpu lists") only checks the head page during PCP refill and allocation operations. This was an oversight and all pages should be checked. This will incur a small performance penalty but it's necessary for correctness. Fixes: 44042b449872 ("mm/page_alloc: allow high-order pages to be stored on the per-cpu lists") Reported-by: Eric Dumazet Signed-off-by: Mel Gorman --- mm/page_alloc.c | 46 +++++++++++++++++++++++----------------------- 1 file changed, 23 insertions(+), 23 deletions(-) diff --git a/mm/page_alloc.c b/mm/page_alloc.c index 3589febc6d31..2920344fa887 100644 --- a/mm/page_alloc.c +++ b/mm/page_alloc.c @@ -2342,23 +2342,36 @@ static inline int check_new_page(struct page *page) return 1; } +static bool check_new_pages(struct page *page, unsigned int order) +{ + int i; + for (i = 0; i < (1 << order); i++) { + struct page *p = page + i; + + if (unlikely(check_new_page(p))) + return true; + } + + return false; +} + #ifdef CONFIG_DEBUG_VM /* * With DEBUG_VM enabled, order-0 pages are checked for expected state when * being allocated from pcp lists. With debug_pagealloc also enabled, they are * also checked when pcp lists are refilled from the free lists. */ -static inline bool check_pcp_refill(struct page *page) +static inline bool check_pcp_refill(struct page *page, unsigned int order) { if (debug_pagealloc_enabled_static()) - return check_new_page(page); + return check_new_pages(page, order); else return false; } -static inline bool check_new_pcp(struct page *page) +static inline bool check_new_pcp(struct page *page, unsigned int order) { - return check_new_page(page); + return check_new_pages(page, order); } #else /* @@ -2366,32 +2379,19 @@ static inline bool check_new_pcp(struct page *page) * when pcp lists are being refilled from the free lists. With debug_pagealloc * enabled, they are also checked when being allocated from the pcp lists. */ -static inline bool check_pcp_refill(struct page *page) +static inline bool check_pcp_refill(struct page *page, unsigned int order) { - return check_new_page(page); + return check_new_pages(page, order); } -static inline bool check_new_pcp(struct page *page) +static inline bool check_new_pcp(struct page *page, unsigned int order) { if (debug_pagealloc_enabled_static()) - return check_new_page(page); + return check_new_pages(page, order); else return false; } #endif /* CONFIG_DEBUG_VM */ -static bool check_new_pages(struct page *page, unsigned int order) -{ - int i; - for (i = 0; i < (1 << order); i++) { - struct page *p = page + i; - - if (unlikely(check_new_page(p))) - return true; - } - - return false; -} - inline void post_alloc_hook(struct page *page, unsigned int order, gfp_t gfp_flags) { @@ -3037,7 +3037,7 @@ static int rmqueue_bulk(struct zone *zone, unsigned int order, if (unlikely(page == NULL)) break; - if (unlikely(check_pcp_refill(page))) + if (unlikely(check_pcp_refill(page, order))) continue; /* @@ -3641,7 +3641,7 @@ struct page *__rmqueue_pcplist(struct zone *zone, unsigned int order, page = list_first_entry(list, struct page, lru); list_del(&page->lru); pcp->count -= 1 << order; - } while (check_new_pcp(page)); + } while (check_new_pcp(page, order)); return page; }