* [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() to take a folio
@ 2025-05-09 13:01 Baolin Wang
2025-05-09 13:01 ` [PATCH v2 2/2] mm: convert do_set_pmd() " Baolin Wang
2025-05-09 14:58 ` [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() " David Hildenbrand
0 siblings, 2 replies; 5+ messages in thread
From: Baolin Wang @ 2025-05-09 13:01 UTC (permalink / raw)
To: akpm, willy, david
Cc: hannes, lorenzo.stoakes, Liam.Howlett, npache, ryan.roberts,
dev.jain, ziy, vbabka, rppt, surenb, mhocko, baolin.wang,
linux-mm, linux-kernel
We've already gotten the stable locked folio in collapse_pte_mapped_thp(),
so just use folio for set_huge_pmd() to set the PMD entry, which is more
straightforward.
Moreover, we will check the folio size in do_set_pmd(), so we can remove
the unnecessary VM_BUG_ON() in set_huge_pmd(). While we are at it, we can
also remove the PageTransHuge(), as it currently has no callers.
Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com>
---
Changes from v1:
- Remove the unnecessary VM_BUG_ON().
- Remove the PageTransHuge().
---
include/linux/page-flags.h | 15 ---------------
mm/khugepaged.c | 9 ++++-----
2 files changed, 4 insertions(+), 20 deletions(-)
diff --git a/include/linux/page-flags.h b/include/linux/page-flags.h
index 37b11f15dbd9..1c1d49554c71 100644
--- a/include/linux/page-flags.h
+++ b/include/linux/page-flags.h
@@ -907,20 +907,6 @@ FOLIO_FLAG_FALSE(partially_mapped)
#define PG_head_mask ((1UL << PG_head))
#ifdef CONFIG_TRANSPARENT_HUGEPAGE
-/*
- * PageHuge() only returns true for hugetlbfs pages, but not for
- * normal or transparent huge pages.
- *
- * PageTransHuge() returns true for both transparent huge and
- * hugetlbfs pages, but not normal pages. PageTransHuge() can only be
- * called only in the core VM paths where hugetlbfs pages can't exist.
- */
-static inline int PageTransHuge(const struct page *page)
-{
- VM_BUG_ON_PAGE(PageTail(page), page);
- return PageHead(page);
-}
-
/*
* PageTransCompound returns true for both transparent huge pages
* and hugetlbfs pages, so it should only be called when it's known
@@ -931,7 +917,6 @@ static inline int PageTransCompound(const struct page *page)
return PageCompound(page);
}
#else
-TESTPAGEFLAG_FALSE(TransHuge, transhuge)
TESTPAGEFLAG_FALSE(TransCompound, transcompound)
#endif
diff --git a/mm/khugepaged.c b/mm/khugepaged.c
index b04b6a770afe..aca66e7f4fd9 100644
--- a/mm/khugepaged.c
+++ b/mm/khugepaged.c
@@ -1467,7 +1467,7 @@ static void collect_mm_slot(struct khugepaged_mm_slot *mm_slot)
#ifdef CONFIG_SHMEM
/* hpage must be locked, and mmap_lock must be held */
static int set_huge_pmd(struct vm_area_struct *vma, unsigned long addr,
- pmd_t *pmdp, struct page *hpage)
+ pmd_t *pmdp, struct folio *folio)
{
struct vm_fault vmf = {
.vma = vma,
@@ -1476,13 +1476,12 @@ static int set_huge_pmd(struct vm_area_struct *vma, unsigned long addr,
.pmd = pmdp,
};
- VM_BUG_ON(!PageTransHuge(hpage));
mmap_assert_locked(vma->vm_mm);
- if (do_set_pmd(&vmf, hpage))
+ if (do_set_pmd(&vmf, &folio->page))
return SCAN_FAIL;
- get_page(hpage);
+ folio_get(folio);
return SCAN_SUCCEED;
}
@@ -1689,7 +1688,7 @@ int collapse_pte_mapped_thp(struct mm_struct *mm, unsigned long addr,
maybe_install_pmd:
/* step 5: install pmd entry */
result = install_pmd
- ? set_huge_pmd(vma, haddr, pmd, &folio->page)
+ ? set_huge_pmd(vma, haddr, pmd, folio)
: SCAN_SUCCEED;
goto drop_folio;
abort:
--
2.43.5
^ permalink raw reply [flat|nested] 5+ messages in thread
* [PATCH v2 2/2] mm: convert do_set_pmd() to take a folio
2025-05-09 13:01 [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() to take a folio Baolin Wang
@ 2025-05-09 13:01 ` Baolin Wang
2025-05-09 15:01 ` Zi Yan
2025-05-09 14:58 ` [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() " David Hildenbrand
1 sibling, 1 reply; 5+ messages in thread
From: Baolin Wang @ 2025-05-09 13:01 UTC (permalink / raw)
To: akpm, willy, david
Cc: hannes, lorenzo.stoakes, Liam.Howlett, npache, ryan.roberts,
dev.jain, ziy, vbabka, rppt, surenb, mhocko, baolin.wang,
linux-mm, linux-kernel
In do_set_pmd(), we always use the folio->page to build PMD mappings for
the entire folio. Since all callers of do_set_pmd() already hold a stable
folio, converting do_set_pmd() to take a folio is safe and more straightforward.
In addition, to ensure the extensibility of do_set_pmd() for supporting
larger folios beyond PMD size, we keep the 'page' parameter to specify
which page within the folio should be mapped.
No functional changes expected.
Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com>
---
Changes from v1:
- Keep the 'page' parameter of the do_set_pmd().
Note: I did mm selftests and built kernel on tmpfs/xfs filesystems, and
did not find any regression.
---
include/linux/mm.h | 2 +-
mm/filemap.c | 2 +-
mm/khugepaged.c | 2 +-
mm/memory.c | 11 +++++------
4 files changed, 8 insertions(+), 9 deletions(-)
diff --git a/include/linux/mm.h b/include/linux/mm.h
index 43748c8f3454..d5f578c91e77 100644
--- a/include/linux/mm.h
+++ b/include/linux/mm.h
@@ -1237,7 +1237,7 @@ static inline pte_t maybe_mkwrite(pte_t pte, struct vm_area_struct *vma)
return pte;
}
-vm_fault_t do_set_pmd(struct vm_fault *vmf, struct page *page);
+vm_fault_t do_set_pmd(struct vm_fault *vmf, struct folio *folio, struct page *page);
void set_pte_range(struct vm_fault *vmf, struct folio *folio,
struct page *page, unsigned int nr, unsigned long addr);
diff --git a/mm/filemap.c b/mm/filemap.c
index 7b90cbeb4a1a..09d005848f0d 100644
--- a/mm/filemap.c
+++ b/mm/filemap.c
@@ -3533,7 +3533,7 @@ static bool filemap_map_pmd(struct vm_fault *vmf, struct folio *folio,
if (pmd_none(*vmf->pmd) && folio_test_pmd_mappable(folio)) {
struct page *page = folio_file_page(folio, start);
- vm_fault_t ret = do_set_pmd(vmf, page);
+ vm_fault_t ret = do_set_pmd(vmf, folio, page);
if (!ret) {
/* The page is mapped successfully, reference consumed. */
folio_unlock(folio);
diff --git a/mm/khugepaged.c b/mm/khugepaged.c
index aca66e7f4fd9..c4b5031f3254 100644
--- a/mm/khugepaged.c
+++ b/mm/khugepaged.c
@@ -1478,7 +1478,7 @@ static int set_huge_pmd(struct vm_area_struct *vma, unsigned long addr,
mmap_assert_locked(vma->vm_mm);
- if (do_set_pmd(&vmf, &folio->page))
+ if (do_set_pmd(&vmf, folio, &folio->page))
return SCAN_FAIL;
folio_get(folio);
diff --git a/mm/memory.c b/mm/memory.c
index 68c1d962d0ad..9c202c32ca66 100644
--- a/mm/memory.c
+++ b/mm/memory.c
@@ -5176,9 +5176,8 @@ static void deposit_prealloc_pte(struct vm_fault *vmf)
vmf->prealloc_pte = NULL;
}
-vm_fault_t do_set_pmd(struct vm_fault *vmf, struct page *page)
+vm_fault_t do_set_pmd(struct vm_fault *vmf, struct folio *folio, struct page *page)
{
- struct folio *folio = page_folio(page);
struct vm_area_struct *vma = vmf->vma;
bool write = vmf->flags & FAULT_FLAG_WRITE;
unsigned long haddr = vmf->address & HPAGE_PMD_MASK;
@@ -5251,7 +5250,7 @@ vm_fault_t do_set_pmd(struct vm_fault *vmf, struct page *page)
return ret;
}
#else
-vm_fault_t do_set_pmd(struct vm_fault *vmf, struct page *page)
+vm_fault_t do_set_pmd(struct vm_fault *vmf, struct folio *folio, struct page *page)
{
return VM_FAULT_FALLBACK;
}
@@ -5345,6 +5344,7 @@ vm_fault_t finish_fault(struct vm_fault *vmf)
else
page = vmf->page;
+ folio = page_folio(page);
/*
* check even for read faults because we might have lost our CoWed
* page
@@ -5356,8 +5356,8 @@ vm_fault_t finish_fault(struct vm_fault *vmf)
}
if (pmd_none(*vmf->pmd)) {
- if (PageTransCompound(page)) {
- ret = do_set_pmd(vmf, page);
+ if (folio_test_pmd_mappable(folio)) {
+ ret = do_set_pmd(vmf, folio, page);
if (ret != VM_FAULT_FALLBACK)
return ret;
}
@@ -5368,7 +5368,6 @@ vm_fault_t finish_fault(struct vm_fault *vmf)
return VM_FAULT_OOM;
}
- folio = page_folio(page);
nr_pages = folio_nr_pages(folio);
/*
--
2.43.5
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() to take a folio
2025-05-09 13:01 [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() to take a folio Baolin Wang
2025-05-09 13:01 ` [PATCH v2 2/2] mm: convert do_set_pmd() " Baolin Wang
@ 2025-05-09 14:58 ` David Hildenbrand
2025-05-12 2:17 ` Baolin Wang
1 sibling, 1 reply; 5+ messages in thread
From: David Hildenbrand @ 2025-05-09 14:58 UTC (permalink / raw)
To: Baolin Wang, akpm, willy
Cc: hannes, lorenzo.stoakes, Liam.Howlett, npache, ryan.roberts,
dev.jain, ziy, vbabka, rppt, surenb, mhocko, linux-mm,
linux-kernel
> diff --git a/mm/khugepaged.c b/mm/khugepaged.c
> index b04b6a770afe..aca66e7f4fd9 100644
> --- a/mm/khugepaged.c
> +++ b/mm/khugepaged.c
> @@ -1467,7 +1467,7 @@ static void collect_mm_slot(struct khugepaged_mm_slot *mm_slot)
> #ifdef CONFIG_SHMEM
> /* hpage must be locked, and mmap_lock must be held */
^ that comment probably needs some love.
> static int set_huge_pmd(struct vm_area_struct *vma, unsigned long addr,
> - pmd_t *pmdp, struct page *hpage)
> + pmd_t *pmdp, struct folio *folio)
> {
> struct vm_fault vmf = {
> .vma = vma,
> @@ -1476,13 +1476,12 @@ static int set_huge_pmd(struct vm_area_struct *vma, unsigned long addr,
> .pmd = pmdp,
> };
>
> - VM_BUG_ON(!PageTransHuge(hpage));
> mmap_assert_locked(vma->vm_mm);
>
> - if (do_set_pmd(&vmf, hpage))
> + if (do_set_pmd(&vmf, &folio->page))
> return SCAN_FAIL;
>
> - get_page(hpage);
> + folio_get(folio);
> return SCAN_SUCCEED;
> }
>
> @@ -1689,7 +1688,7 @@ int collapse_pte_mapped_thp(struct mm_struct *mm, unsigned long addr,
> maybe_install_pmd:
> /* step 5: install pmd entry */
> result = install_pmd
> - ? set_huge_pmd(vma, haddr, pmd, &folio->page)
> + ? set_huge_pmd(vma, haddr, pmd, folio)
Wondering why we are not passing in the folio+page pair in here as well.
I assume in the foreseeable future this code will not be able to work
with folios large than PMDs?
Apart from that LGTM.
Acked-by: David Hildenbrand <david@redhat.com>
--
Cheers,
David / dhildenb
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2 2/2] mm: convert do_set_pmd() to take a folio
2025-05-09 13:01 ` [PATCH v2 2/2] mm: convert do_set_pmd() " Baolin Wang
@ 2025-05-09 15:01 ` Zi Yan
0 siblings, 0 replies; 5+ messages in thread
From: Zi Yan @ 2025-05-09 15:01 UTC (permalink / raw)
To: Baolin Wang
Cc: akpm, willy, david, hannes, lorenzo.stoakes, Liam.Howlett,
npache, ryan.roberts, dev.jain, vbabka, rppt, surenb, mhocko,
linux-mm, linux-kernel
On 9 May 2025, at 9:01, Baolin Wang wrote:
> In do_set_pmd(), we always use the folio->page to build PMD mappings for
> the entire folio. Since all callers of do_set_pmd() already hold a stable
> folio, converting do_set_pmd() to take a folio is safe and more straightforward.
>
> In addition, to ensure the extensibility of do_set_pmd() for supporting
> larger folios beyond PMD size, we keep the 'page' parameter to specify
> which page within the folio should be mapped.
>
> No functional changes expected.
>
> Signed-off-by: Baolin Wang <baolin.wang@linux.alibaba.com>
> ---
> Changes from v1:
> - Keep the 'page' parameter of the do_set_pmd().
>
> Note: I did mm selftests and built kernel on tmpfs/xfs filesystems, and
> did not find any regression.
> ---
> include/linux/mm.h | 2 +-
> mm/filemap.c | 2 +-
> mm/khugepaged.c | 2 +-
> mm/memory.c | 11 +++++------
> 4 files changed, 8 insertions(+), 9 deletions(-)
>
LGTM. Reviewed-by: Zi Yan <ziy@nvidia.com>
--
Best Regards,
Yan, Zi
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() to take a folio
2025-05-09 14:58 ` [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() " David Hildenbrand
@ 2025-05-12 2:17 ` Baolin Wang
0 siblings, 0 replies; 5+ messages in thread
From: Baolin Wang @ 2025-05-12 2:17 UTC (permalink / raw)
To: David Hildenbrand, akpm, willy
Cc: hannes, lorenzo.stoakes, Liam.Howlett, npache, ryan.roberts,
dev.jain, ziy, vbabka, rppt, surenb, mhocko, linux-mm,
linux-kernel
On 2025/5/9 22:58, David Hildenbrand wrote:
>> diff --git a/mm/khugepaged.c b/mm/khugepaged.c
>> index b04b6a770afe..aca66e7f4fd9 100644
>> --- a/mm/khugepaged.c
>> +++ b/mm/khugepaged.c
>> @@ -1467,7 +1467,7 @@ static void collect_mm_slot(struct
>> khugepaged_mm_slot *mm_slot)
>> #ifdef CONFIG_SHMEM
>> /* hpage must be locked, and mmap_lock must be held */
>
> ^ that comment probably needs some love.
Ah, missed that. Will update the comments in next version.
>
>> static int set_huge_pmd(struct vm_area_struct *vma, unsigned long addr,
>> - pmd_t *pmdp, struct page *hpage)
>> + pmd_t *pmdp, struct folio *folio)
>> {
>> struct vm_fault vmf = {
>> .vma = vma,
>> @@ -1476,13 +1476,12 @@ static int set_huge_pmd(struct vm_area_struct
>> *vma, unsigned long addr,
>> .pmd = pmdp,
>> };
>> - VM_BUG_ON(!PageTransHuge(hpage));
>> mmap_assert_locked(vma->vm_mm);
>> - if (do_set_pmd(&vmf, hpage))
>> + if (do_set_pmd(&vmf, &folio->page))
>> return SCAN_FAIL;
>> - get_page(hpage);
>> + folio_get(folio);
>> return SCAN_SUCCEED;
>> }
>> @@ -1689,7 +1688,7 @@ int collapse_pte_mapped_thp(struct mm_struct
>> *mm, unsigned long addr,
>> maybe_install_pmd:
>> /* step 5: install pmd entry */
>> result = install_pmd
>> - ? set_huge_pmd(vma, haddr, pmd, &folio->page)
>> + ? set_huge_pmd(vma, haddr, pmd, folio)
>
> Wondering why we are not passing in the folio+page pair in here as well.
> I assume in the foreseeable future this code will not be able to work
> with folios large than PMDs?
OK. Will do in next version.
> Apart from that LGTM.
>
> Acked-by: David Hildenbrand <david@redhat.com>
Thanks.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2025-05-12 2:17 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-05-09 13:01 [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() to take a folio Baolin Wang
2025-05-09 13:01 ` [PATCH v2 2/2] mm: convert do_set_pmd() " Baolin Wang
2025-05-09 15:01 ` Zi Yan
2025-05-09 14:58 ` [PATCH v2 1/2] mm: khugepaged: convert set_huge_pmd() " David Hildenbrand
2025-05-12 2:17 ` Baolin Wang
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox