linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Kefeng Wang <wangkefeng.wang@huawei.com>
To: David Hildenbrand <david@redhat.com>,
	Andrew Morton <akpm@linux-foundation.org>
Cc: Oscar Salvador <osalvador@suse.de>,
	Miaohe Lin <linmiaohe@huawei.com>,
	Naoya Horiguchi <nao.horiguchi@gmail.com>, <linux-mm@kvack.org>
Subject: Re: [PATCH 2/4] mm: memory_hotplug: check hwpoisoned page firstly in do_migrate_range()
Date: Wed, 31 Jul 2024 13:09:25 +0800	[thread overview]
Message-ID: <56b9a389-4c36-4892-962d-b45878f30f4d@huawei.com> (raw)
In-Reply-To: <b8d46144-ccb3-4f90-96bc-8818c69df471@redhat.com>



On 2024/7/30 18:26, David Hildenbrand wrote:
> On 25.07.24 03:16, Kefeng Wang wrote:
>> The commit b15c87263a69 ("hwpoison, memory_hotplug: allow hwpoisoned
>> pages to be offlined") don't handle the hugetlb pages, the dead loop
>> still occur if offline a hwpoison hugetlb, luckly, after the commit
>> e591ef7d96d6 ("mm,hwpoison,hugetlb,memory_hotplug: hotremove memory
>> section with hwpoisoned hugepage"), the HPageMigratable of hugetlb
>> page will be clear, and the hwpoison hugetlb page will be skipped in
>> scan_movable_pages(), so the deed loop issue is fixed.
> 
> did you mean "endless loop" ?

Exactly, will fix the words.

> 
>>
>> However if the HPageMigratable() check passed(without reference and
>> lock), the hugetlb page may be hwpoisoned, it won't cause issue since
>> the hwpoisoned page will be handled correctly in the next movable
>> pages scan loop, and it will be isolated in do_migrate_range() and
>> but fails to migrated. In order to avoid the unnecessary isolation and
>> unify all hwpoisoned page handling, let's unconditionally check hwpoison
>> firstly, and if it is a hwpoisoned hugetlb page, try to unmap it as
>> the catch all safety net like normal page does.
> 
> But what's the benefit here besides slightly faster handling in an 
> absolute corner case (I strongly suspect that we don't care)?

Yes, it is a very corner case, the goal is to move isolate_hugetlb()
after HWpoison check, then to unify isolation and folio conversion
(patch4). But we must correctly handle the hugetlb unmap when meet
a hwpoisoned page.

> 
>>
>> Signed-off-by: Kefeng Wang <wangkefeng.wang@huawei.com>
>> ---
>>   mm/memory_hotplug.c | 27 ++++++++++++++++-----------
>>   1 file changed, 16 insertions(+), 11 deletions(-)
>>
>> diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c
>> index 66267c26ca1b..ccaf4c480aed 100644
>> --- a/mm/memory_hotplug.c
>> +++ b/mm/memory_hotplug.c
>> @@ -1788,28 +1788,33 @@ static void do_migrate_range(unsigned long 
>> start_pfn, unsigned long end_pfn)
>>           folio = page_folio(page);
>>           head = &folio->page;
>> -        if (PageHuge(page)) {
>> -            pfn = page_to_pfn(head) + compound_nr(head) - 1;
>> -            isolate_hugetlb(folio, &source);
>> -            continue;
>> -        } else if (PageTransHuge(page))
>> -            pfn = page_to_pfn(head) + thp_nr_pages(page) - 1;
>> -
>>           /*
>>            * HWPoison pages have elevated reference counts so the 
>> migration would
>>            * fail on them. It also doesn't make any sense to migrate 
>> them in the
>>            * first place. Still try to unmap such a page in case it is 
>> still mapped
>> -         * (e.g. current hwpoison implementation doesn't unmap KSM 
>> pages but keep
>> -         * the unmap as the catch all safety net).
>> +         * (keep the unmap as the catch all safety net).
>>            */
>> -        if (PageHWPoison(page)) {
>> +        if (unlikely(PageHWPoison(page))) {
> 
> We're not checking the head page here, will this work reliably for 
> hugetlb? (I recall some difference in per-page hwpoison handling between 
> hugetlb and THP due to the vmemmap optimization)

Before this changes, the hwposioned hugetlb page won't try to unmap in
do_migrate_range(), we hope it already unmapped in memory_failure(), as 
mentioned from comments, there maybe fail to unmap, so a new safeguard 
to try to unmap it again here, but we don't need to guarantee it.

The unmap_posioned_folio() used to correctly handle hugetlb pages in 
shared mappings if we met a hwpoisoned page(maybe headpage/may subpage).

> 
> 


  reply	other threads:[~2024-07-31  5:09 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-07-25  1:16 [PATCH 0/4] mm: memory_hotplug: improve do_migrate_range() Kefeng Wang
2024-07-25  1:16 ` [PATCH 1/4] mm: memory-failure: add unmap_posioned_folio() Kefeng Wang
2024-07-30 10:20   ` David Hildenbrand
2024-07-31  4:46     ` Kefeng Wang
2024-07-25  1:16 ` [PATCH 2/4] mm: memory_hotplug: check hwpoisoned page firstly in do_migrate_range() Kefeng Wang
2024-07-30 10:26   ` David Hildenbrand
2024-07-31  5:09     ` Kefeng Wang [this message]
2024-08-01 20:10       ` David Hildenbrand
2024-08-02  7:50         ` Kefeng Wang
2024-08-06  9:29           ` David Hildenbrand
     [not found]             ` <1e6cccc5-fedc-8df6-1deb-16ceb52a4094@huawei.com>
     [not found]               ` <1e14d86d-0d17-41da-9400-16c9c6f93f8f@redhat.com>
2024-08-09  2:02                 ` Miaohe Lin
2024-08-01 20:14   ` David Hildenbrand
2024-08-02  8:02     ` Kefeng Wang
2024-08-06  3:44       ` Kefeng Wang
2024-08-06  9:24         ` David Hildenbrand
2024-08-06  9:15       ` David Hildenbrand
2024-07-25  1:16 ` [PATCH 3/4] mm: migrate: add isolate_folio_to_list() Kefeng Wang
2024-07-26 14:21   ` kernel test robot
2024-07-27  7:56     ` Kefeng Wang
2024-07-30 10:30   ` David Hildenbrand
2024-07-25  1:16 ` [PATCH 4/4] mm: memory_hotplug: unify Huge/LRU/non-LRU movable folio isolation Kefeng Wang
2024-07-30 10:31   ` David Hildenbrand
2024-07-31  5:13     ` Kefeng Wang
2024-08-01 20:16       ` David Hildenbrand
2024-08-01 20:23   ` David Hildenbrand
2024-08-02  8:39     ` Kefeng Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=56b9a389-4c36-4892-962d-b45878f30f4d@huawei.com \
    --to=wangkefeng.wang@huawei.com \
    --cc=akpm@linux-foundation.org \
    --cc=david@redhat.com \
    --cc=linmiaohe@huawei.com \
    --cc=linux-mm@kvack.org \
    --cc=nao.horiguchi@gmail.com \
    --cc=osalvador@suse.de \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox