linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] mm/page_alloc: Fix freeing of failed-split poisoned compound pages
@ 2026-01-13 20:54 Boudewijn van der Heide
  2026-01-13 21:05 ` Zi Yan
  0 siblings, 1 reply; 3+ messages in thread
From: Boudewijn van der Heide @ 2026-01-13 20:54 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Vlastimil Babka, Suren Baghdasaryan, Michal Hocko,
	Brendan Jackman, Johannes Weiner, Zi Yan, Naoya Horiguchi,
	Oscar Salvador, linux-mm, linux-kernel, boudewijn

free_pages_prepare() only handles poisoned order-0 pages.
In memory_failure() (hard offline), pages
are poisoned before attempting to split huge pages. If the split fails,
the page remains a compound (order > 0) but is already poisoned. However,
Soft-offline pages are always poisoned as order-0 after migration, so
they are unaffected.

The '!order' check causes these poisoned compound pages to skip
poison handling, leaving them in the buddy allocator.

Worst case, a poisoned compound page could be reallocated,
potentially leading to crashes, silent data corruption,
or unwanted memory containment actions before the poison bit is detected.

This patch removes the '&& !order' restriction. Cleanup functions in the
poison-handling block correctly handle non-zero order pages, making
this change safe.

Fixes: 79f5f8fab482 ("mm,hwpoison: rework soft offline for in-use pages")
Signed-off-by: Boudewijn van der Heide <boudewijn@delta-utec.com>
---
 mm/page_alloc.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/mm/page_alloc.c b/mm/page_alloc.c
index c380f063e8b7..64d15e56706c 100644
--- a/mm/page_alloc.c
+++ b/mm/page_alloc.c
@@ -1344,7 +1344,7 @@ __always_inline bool free_pages_prepare(struct page *page,
 		count_vm_events(UNEVICTABLE_PGCLEARED, nr_pages);
 	}
 
-	if (unlikely(PageHWPoison(page)) && !order) {
+	if (unlikely(PageHWPoison(page))) {
 		/* Do not let hwpoison pages hit pcplists/buddy */
 		reset_page_owner(page, order);
 		page_table_check_free(page, order);
-- 
2.47.3



^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] mm/page_alloc: Fix freeing of failed-split poisoned compound pages
  2026-01-13 20:54 [PATCH] mm/page_alloc: Fix freeing of failed-split poisoned compound pages Boudewijn van der Heide
@ 2026-01-13 21:05 ` Zi Yan
  2026-01-14 14:48   ` Boudewijn van der Heide
  0 siblings, 1 reply; 3+ messages in thread
From: Zi Yan @ 2026-01-13 21:05 UTC (permalink / raw)
  To: Boudewijn van der Heide
  Cc: Andrew Morton, Vlastimil Babka, Suren Baghdasaryan, Michal Hocko,
	Brendan Jackman, Johannes Weiner, Naoya Horiguchi,
	Oscar Salvador, linux-mm, linux-kernel, Miaohe Lin

Add Miaohe (memory failure maintainer)

On 13 Jan 2026, at 15:54, Boudewijn van der Heide wrote:

> free_pages_prepare() only handles poisoned order-0 pages.
> In memory_failure() (hard offline), pages
> are poisoned before attempting to split huge pages. If the split fails,
> the page remains a compound (order > 0) but is already poisoned. However,
> Soft-offline pages are always poisoned as order-0 after migration, so
> they are unaffected.
>
> The '!order' check causes these poisoned compound pages to skip
> poison handling, leaving them in the buddy allocator.
>
> Worst case, a poisoned compound page could be reallocated,
> potentially leading to crashes, silent data corruption,
> or unwanted memory containment actions before the poison bit is detected.
>
> This patch removes the '&& !order' restriction. Cleanup functions in the
> poison-handling block correctly handle non-zero order pages, making
> this change safe.

This is not a fix. IIUC, for >0 order free pages, memory failure uses
take_page_off_buddy() in a different code path.

Miaohe (cc’d) should be able to elaborate more on it.

>
> Fixes: 79f5f8fab482 ("mm,hwpoison: rework soft offline for in-use pages")
> Signed-off-by: Boudewijn van der Heide <boudewijn@delta-utec.com>
> ---
>  mm/page_alloc.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/mm/page_alloc.c b/mm/page_alloc.c
> index c380f063e8b7..64d15e56706c 100644
> --- a/mm/page_alloc.c
> +++ b/mm/page_alloc.c
> @@ -1344,7 +1344,7 @@ __always_inline bool free_pages_prepare(struct page *page,
>  		count_vm_events(UNEVICTABLE_PGCLEARED, nr_pages);
>  	}
>
> -	if (unlikely(PageHWPoison(page)) && !order) {
> +	if (unlikely(PageHWPoison(page))) {
>  		/* Do not let hwpoison pages hit pcplists/buddy */
>  		reset_page_owner(page, order);
>  		page_table_check_free(page, order);
> -- 
> 2.47.3


Best Regards,
Yan, Zi


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] mm/page_alloc: Fix freeing of failed-split poisoned compound pages
  2026-01-13 21:05 ` Zi Yan
@ 2026-01-14 14:48   ` Boudewijn van der Heide
  0 siblings, 0 replies; 3+ messages in thread
From: Boudewijn van der Heide @ 2026-01-14 14:48 UTC (permalink / raw)
  To: ziy
  Cc: akpm, boudewijn, hannes, jackmanb, linmiaohe, linux-kernel,
	linux-mm, mhocko, nao.horiguchi, osalvador, surenb, vbabka

> > free_pages_prepare() only handles poisoned order-0 pages.
> > In memory_failure() (hard offline), pages
> > are poisoned before attempting to split huge pages. If the split fails,
> > the page remains a compound (order > 0) but is already poisoned. However,
> > Soft-offline pages are always poisoned as order-0 after migration, so
> > they are unaffected.
> >
> > The '!order' check causes these poisoned compound pages to skip
> > poison handling, leaving them in the buddy allocator.
> >
> > Worst case, a poisoned compound page could be reallocated,
> > potentially leading to crashes, silent data corruption,
> > or unwanted memory containment actions before the poison bit is detected.
> >
> > This patch removes the '&& !order' restriction. Cleanup functions in the
> > poison-handling block correctly handle non-zero order pages, making
> > this change safe.

> This is not a fix. IIUC, for >0 order free pages, memory failure uses
> take_page_off_buddy() in a different code path.
>

Thanks again for the quick response and clarification! 
From my understanding, 
you correctly noted that take_page_off_buddy() handles already-free pages, 
removing them from the buddy lists and setting SetPageHWPoisonTakenOff(). 
This prevents those pages from re-entering the buddy allocator.

My concern is about in-use THP-backed compound pages:
1. A compound page is in use. 
2. memory_failure() marks it poisoned (TestSetPageHWPoison).
3. try_to_split_thp_page() fails.
4. The process using the THP may be killed; 
        the page remains compound and poisoned.
5. Later, when the page is finally freed, it reaches free_pages_prepare();
        'take_page_off_buddy()' is not invoked in this path.

At this point, the current check:

'if (unlikely(PageHWPoison(page)) && !order)'

will not trigger, because the order > 0.

> Miaohe (cc’d) should be able to elaborate more on it.
Thanks for Cc'ing Miaohe, hopefully Miaohe can provide some more insights!

Thanks,
Boudewijn


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2026-01-14 14:48 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2026-01-13 20:54 [PATCH] mm/page_alloc: Fix freeing of failed-split poisoned compound pages Boudewijn van der Heide
2026-01-13 21:05 ` Zi Yan
2026-01-14 14:48   ` Boudewijn van der Heide

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox