Re: [PATCH] mm: Avoid calling folio_page() with an out-of-bounds index

linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed

From: "Li Zhe" <lizhe.67@bytedance.com>
To: <david@kernel.org>
Cc: <Liam.Howlett@oracle.com>, <akpm@linux-foundation.org>,
	 <ankur.a.arora@oracle.com>, <linux-kernel@vger.kernel.org>,
	 <linux-mm@kvack.org>, <lizhe.67@bytedance.com>,
	 <lorenzo.stoakes@oracle.com>, <mhocko@suse.com>,
	<rppt@kernel.org>,  <surenb@google.com>, <vbabka@suse.cz>,
	<dev.jain@arm.com>
Subject: Re: [PATCH] mm: Avoid calling folio_page() with an out-of-bounds index
Date: Thu, 26 Feb 2026 11:14:04 +0800	[thread overview]
Message-ID: <20260226031404.82059-1-lizhe.67@bytedance.com> (raw)
In-Reply-To: <81065dd1-c131-48db-9ef6-01c0ec0001da@kernel.org>

On Wed, 25 Feb 2026 14:40:41 +0100 david@kernel.org wrote:

> On 2/25/26 10:26, Li Zhe wrote:
> > In folio_zero_user(), the page pointer is calculated via folio_page()
> > before checking if the number of pages to be cleared is greater than zero.
> > Furthermore, folio_page() does not verify that the page number lies
> > within folio.
> > 
> > When 'addr_hint' is near the end of a large folio, the range 'r[0]'
> > represents an empty interval. In this scenario, 'nr_pages' will be
> > calculated as 0 and 'r[0].start' can be an index that is out-of-bounds
> > for folio_page(). The code unconditionally calls folio_page() on a wrong
> > index, even though the subsequent clearing logic is correctly skipped.
> > 
> > While this does not cause a functional bug today, calculating a page
> > pointer for an out-of-bounds index is logically unsound and fragile. It
> > could pose a risk for future refactoring or trigger warnings from static
> > analysis tools.
> > 
> > To fix this, move the call to folio_page() inside the 'if (nr_pages > 0)'
> > block. This ensures that the page pointer is only calculated when it is
> > actually needed for a valid, non-empty range of pages, thus making the code
> > more robust and logically correct.
> > 
> > Signed-off-by: Li Zhe <lizhe.67@bytedance.com>
> > ---
> >  mm/memory.c | 8 +++++---
> >  1 file changed, 5 insertions(+), 3 deletions(-)
> > 
> > diff --git a/mm/memory.c b/mm/memory.c
> > index 07778814b4a8..6f8c55d604b5 100644
> > --- a/mm/memory.c
> > +++ b/mm/memory.c
> > @@ -7343,12 +7343,14 @@ void folio_zero_user(struct folio *folio, unsigned long addr_hint)
> >  	r[0] = DEFINE_RANGE(r[2].end + 1, pg.end);
> >  
> >  	for (i = 0; i < ARRAY_SIZE(r); i++) {
> > -		const unsigned long addr = base_addr + r[i].start * PAGE_SIZE;
> >  		const long nr_pages = (long)range_len(&r[i]);
> > -		struct page *page = folio_page(folio, r[i].start);
> >  
> > -		if (nr_pages > 0)
> > +		if (nr_pages > 0) {
> > +			const unsigned long addr = base_addr + r[i].start * PAGE_SIZE;
> > +			struct page *page = folio_page(folio, r[i].start);
> > +
> >  			clear_contig_highpages(page, addr, nr_pages);
> > +		}
> >  	}
> >  }
> >  
> 
> It's all just arithmetic operations. Just like Willy says, I'd assume
> that the compiler can just move them around as it pleases. No dependencies.
> 
> So I don't see any real benefit with this patch.

Yes, this change does not offer any performance benefits under the
current conditions. I proposed it primarily as a code cleanup effort.

> > While this does not cause a functional bug today, calculating a page
> > pointer for an out-of-bounds index is logically unsound and fragile. It
> > could pose a risk for future refactoring or trigger warnings from static
> > analysis tools.

This is what I see as the primary benefit of this patch. Should an
out-of-bounds issue surface later, it would still need to be addressed.

Thanks,
Zhe

     prev parent reply	other threads:[~2026-02-26  3:14 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-02-25  9:26 Li Zhe
2026-02-25 10:05 ` Dev Jain
2026-02-25 13:31   ` Matthew Wilcox
2026-02-25 15:48     ` Dev Jain
2026-02-26  6:52     ` Ankur Arora
2026-02-25 13:40 ` David Hildenbrand (Arm)
2026-02-26  3:14   ` Li Zhe [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260226031404.82059-1-lizhe.67@bytedance.com \
    --to=lizhe.67@bytedance.com \
    --cc=Liam.Howlett@oracle.com \
    --cc=akpm@linux-foundation.org \
    --cc=ankur.a.arora@oracle.com \
    --cc=david@kernel.org \
    --cc=dev.jain@arm.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    --cc=lorenzo.stoakes@oracle.com \
    --cc=mhocko@suse.com \
    --cc=rppt@kernel.org \
    --cc=surenb@google.com \
    --cc=vbabka@suse.cz \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox