From: lizhe.67@bytedance.com
To: david@redhat.com
Cc: alex.williamson@redhat.com, farman@linux.ibm.com, jgg@nvidia.com,
jgg@ziepe.ca, kvm@vger.kernel.org, linux-mm@kvack.org,
lizhe.67@bytedance.com, torvalds@linux-foundation.org
Subject: Re: [PATCH v5 1/5] mm: introduce num_pages_contiguous()
Date: Thu, 14 Aug 2025 15:58:27 +0800 [thread overview]
Message-ID: <20250814075827.62858-1-lizhe.67@bytedance.com> (raw)
In-Reply-To: <b426d3b9-c674-436e-95c3-fcc7647a044b@redhat.com>
On Thu, 14 Aug 2025 08:54:44 +0200, david@redhat.com wrote:
> On 14.08.25 08:47, lizhe.67@bytedance.com wrote:
> > From: Li Zhe <lizhe.67@bytedance.com>
> >
> > Let's add a simple helper for determining the number of contiguous pages
> > that represent contiguous PFNs.
> >
> > In an ideal world, this helper would be simpler or not even required.
> > Unfortunately, on some configs we still have to maintain (SPARSEMEM
> > without VMEMMAP), the memmap is allocated per memory section, and we might
> > run into weird corner cases of false positives when blindly testing for
> > contiguous pages only.
> >
> > One example of such false positives would be a memory section-sized hole
> > that does not have a memmap. The surrounding memory sections might get
> > "struct pages" that are contiguous, but the PFNs are actually not.
> >
> > This helper will, for example, be useful for determining contiguous PFNs
> > in a GUP result, to batch further operations across returned "struct
> > page"s. VFIO will utilize this interface to accelerate the VFIO DMA map
> > process.
> >
> > Implementation based on Linus' suggestions to avoid new usage of
> > nth_page() where avoidable.
> >
> > Suggested-by: Linus Torvalds <torvalds@linux-foundation.org>
> > Suggested-by: Jason Gunthorpe <jgg@ziepe.ca>
> > Signed-off-by: Li Zhe <lizhe.67@bytedance.com>
> > Co-developed-by: David Hildenbrand <david@redhat.com>
> > Signed-off-by: David Hildenbrand <david@redhat.com>
> > ---
> > include/linux/mm.h | 7 ++++++-
> > include/linux/mm_inline.h | 35 +++++++++++++++++++++++++++++++++++
> > 2 files changed, 41 insertions(+), 1 deletion(-)
> >
> > diff --git a/include/linux/mm.h b/include/linux/mm.h
> > index 1ae97a0b8ec7..ead6724972cf 100644
> > --- a/include/linux/mm.h
> > +++ b/include/linux/mm.h
> > @@ -1763,7 +1763,12 @@ static inline unsigned long page_to_section(const struct page *page)
> > {
> > return (page->flags >> SECTIONS_PGSHIFT) & SECTIONS_MASK;
> > }
> > -#endif
> > +#else /* !SECTION_IN_PAGE_FLAGS */
> > +static inline unsigned long page_to_section(const struct page *page)
> > +{
> > + return 0;
> > +}
> > +#endif /* SECTION_IN_PAGE_FLAGS */
> >
> > /**
> > * folio_pfn - Return the Page Frame Number of a folio.
> > diff --git a/include/linux/mm_inline.h b/include/linux/mm_inline.h
> > index 89b518ff097e..5ea23891fe4c 100644
> > --- a/include/linux/mm_inline.h
> > +++ b/include/linux/mm_inline.h
> > @@ -616,4 +616,39 @@ static inline bool vma_has_recency(struct vm_area_struct *vma)
> > return true;
> > }
> >
> > +/**
> > + * num_pages_contiguous() - determine the number of contiguous pages
> > + * that represent contiguous PFNs
> > + * @pages: an array of page pointers
> > + * @nr_pages: length of the array, at least 1
> > + *
> > + * Determine the number of contiguous pages that represent contiguous PFNs
> > + * in @pages, starting from the first page.
> > + *
> > + * In kernel configs where contiguous pages might not imply contiguous PFNs
> > + * over memory section boundaries, this function will stop at the memory
> > + * section boundary.
>
> Jason suggested here instead:
>
> "
> In some kernel configs contiguous PFNs will not have contiguous struct
> pages. In these configurations num_pages_contiguous() will return a
> smaller than ideal number. The caller should continue to check for pfn
> contiguity after each call to num_pages_contiguous().
> "
Thank you for the reminder! The comment here should be revised as
follows:
/**
* num_pages_contiguous() - determine the number of contiguous pages
* that represent contiguous PFNs
* @pages: an array of page pointers
* @nr_pages: length of the array, at least 1
*
* Determine the number of contiguous pages that represent contiguous PFNs
* in @pages, starting from the first page.
*
* In some kernel configs contiguous PFNs will not have contiguous struct
* pages. In these configurations num_pages_contiguous() will return a num
* smaller than ideal number. The caller should continue to check for pfn
* contiguity after each call to num_pages_contiguous().
*
* Returns the number of contiguous pages.
*/
Thanks,
Zhe
next prev parent reply other threads:[~2025-08-14 7:58 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-08-14 6:47 [PATCH v5 0/5] vfio/type1: optimize vfio_pin_pages_remote() and vfio_unpin_pages_remote() lizhe.67
2025-08-14 6:47 ` [PATCH v5 1/5] mm: introduce num_pages_contiguous() lizhe.67
2025-08-14 6:54 ` David Hildenbrand
2025-08-14 7:58 ` lizhe.67 [this message]
2025-08-27 18:10 ` Alex Williamson
2025-09-01 3:25 ` lizhe.67
2025-09-29 3:21 ` lizhe.67
2025-09-29 20:19 ` Alex Williamson
2025-09-30 3:36 ` lizhe.67
2025-08-14 6:47 ` [PATCH v5 2/5] vfio/type1: optimize vfio_pin_pages_remote() lizhe.67
2025-08-14 6:47 ` [PATCH v5 3/5] vfio/type1: batch vfio_find_vpfn() in function vfio_unpin_pages_remote() lizhe.67
2025-08-14 6:47 ` [PATCH v5 4/5] vfio/type1: introduce a new member has_rsvd for struct vfio_dma lizhe.67
2025-08-14 6:47 ` [PATCH v5 5/5] vfio/type1: optimize vfio_unpin_pages_remote() lizhe.67
2025-10-06 19:44 ` [PATCH v5 0/5] vfio/type1: optimize vfio_pin_pages_remote() and vfio_unpin_pages_remote() Alex Williamson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20250814075827.62858-1-lizhe.67@bytedance.com \
--to=lizhe.67@bytedance.com \
--cc=alex.williamson@redhat.com \
--cc=david@redhat.com \
--cc=farman@linux.ibm.com \
--cc=jgg@nvidia.com \
--cc=jgg@ziepe.ca \
--cc=kvm@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox