linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: David Hildenbrand <david@redhat.com>
To: Kefeng Wang <wangkefeng.wang@huawei.com>,
	Andrew Morton <akpm@linux-foundation.org>
Cc: Oscar Salvador <osalvador@suse.de>,
	Miaohe Lin <linmiaohe@huawei.com>,
	Naoya Horiguchi <nao.horiguchi@gmail.com>,
	linux-mm@kvack.org
Subject: Re: [PATCH 2/4] mm: memory_hotplug: check hwpoisoned page firstly in do_migrate_range()
Date: Tue, 6 Aug 2024 11:24:18 +0200	[thread overview]
Message-ID: <cbdc4aea-3b1b-4286-aab2-10d01b192080@redhat.com> (raw)
In-Reply-To: <4e693aa6-d742-4fe7-bd97-3d375f96fcfa@huawei.com>

On 06.08.24 05:44, Kefeng Wang wrote:
> Hi David, I have some question,
> 
> On 2024/8/2 16:02, Kefeng Wang wrote:
>>
> ...
>>>>             */
>>>> -        if (PageHWPoison(page)) {
>>>> +        if (unlikely(PageHWPoison(page))) {
>>>> +            folio = page_folio(page);
>>>> +
>>>>                if (WARN_ON(folio_test_lru(folio)))
>>>>                    folio_isolate_lru(folio);
>>>> +
>>>>                if (folio_mapped(folio))
>>>> -                try_to_unmap(folio, TTU_IGNORE_MLOCK);
>>>> +                unmap_posioned_folio(folio, TTU_IGNORE_MLOCK);
>>>> +
>>>> +            if (folio_test_large(folio))
>>>> +                pfn = folio_pfn(folio) + folio_nr_pages(folio) - 1;
>>>>                continue;
>>>>            }
>>>> +        if (PageHuge(page)) {
>>>> +            pfn = page_to_pfn(head) + compound_nr(head) - 1;
>>>> +            isolate_hugetlb(folio, &source);
>>>> +            continue;
>>>> +        } else if (PageTransHuge(page))
> 
> If the page is a tail page, we will BUG_ON(DEBUG_VM enabled) here, but
> it seems that we don't guarantee the page won't be a tail page.

Maybe at some point we might want to remove these sanity checks or have 
explicit, expected-to-be-racy folio functions.

Like folio_test_hugetlb_racy(), folio_test_large_racy(), 
folio_nr_pages_racy().

Because the VM_DEBUG checks for folio_test_large() etc. actually make 
sense in other context where we know that concurrent splitting is 
impossible.

But maybe part of the puzzle will be in the future that we want to do a 
RCU read lock here and perform freeing/splitting under RCU, when we'll 
also have to alloc/free the "struct folio".

> 
>>>> +            pfn = page_to_pfn(head) + thp_nr_pages(page) - 1;
> 
> thp_nr_pages() need a head page, I think it should use head here, so we
> can directly use folio_nr_pages().
> 
>>>
>>> If we can use a folio in the PageHWPoison() case, can we use one here
>>> as well? I know that it's all unreliable when not holding a folio
>>> reference, and we have to be a bit careful.
>>
>> Using a folio here is part of patch4, I want to unify hugetlb/thp(or
>> large folio) with "pfn = folio_pfn(folio) + folio_nr_pages(folio) - 1"
>> when large folio after get a ref.
> 
> Think it again, even the folio don't hold a ref(splitting concurrently
> or something else), folio_nr_pages return incorrect, it won't cause
> issue since we will loop and find movable pages again in
> scan_movable_pages() and try to isolate pages, so directly use
> 
> if (folio_test_large(folio)) {
> 	pfn = folio_pfn(folio) + folio_nr_pages(folio) - 1;
> 	if (folio_test_hugetlb(folio))
> 		isolate_hugetlb(folio, &source);
> }

Likely we should add a comment here that a large folio might get split 
concurrently and that folio_nr_pages() might read garbage. But out loop 
should handle that and we would revisit the split folio later.


-- 
Cheers,

David / dhildenb



  reply	other threads:[~2024-08-06  9:24 UTC|newest]

Thread overview: 26+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-07-25  1:16 [PATCH 0/4] mm: memory_hotplug: improve do_migrate_range() Kefeng Wang
2024-07-25  1:16 ` [PATCH 1/4] mm: memory-failure: add unmap_posioned_folio() Kefeng Wang
2024-07-30 10:20   ` David Hildenbrand
2024-07-31  4:46     ` Kefeng Wang
2024-07-25  1:16 ` [PATCH 2/4] mm: memory_hotplug: check hwpoisoned page firstly in do_migrate_range() Kefeng Wang
2024-07-30 10:26   ` David Hildenbrand
2024-07-31  5:09     ` Kefeng Wang
2024-08-01 20:10       ` David Hildenbrand
2024-08-02  7:50         ` Kefeng Wang
2024-08-06  9:29           ` David Hildenbrand
     [not found]             ` <1e6cccc5-fedc-8df6-1deb-16ceb52a4094@huawei.com>
     [not found]               ` <1e14d86d-0d17-41da-9400-16c9c6f93f8f@redhat.com>
2024-08-09  2:02                 ` Miaohe Lin
2024-08-01 20:14   ` David Hildenbrand
2024-08-02  8:02     ` Kefeng Wang
2024-08-06  3:44       ` Kefeng Wang
2024-08-06  9:24         ` David Hildenbrand [this message]
2024-08-06  9:15       ` David Hildenbrand
2024-07-25  1:16 ` [PATCH 3/4] mm: migrate: add isolate_folio_to_list() Kefeng Wang
2024-07-26 14:21   ` kernel test robot
2024-07-27  7:56     ` Kefeng Wang
2024-07-30 10:30   ` David Hildenbrand
2024-07-25  1:16 ` [PATCH 4/4] mm: memory_hotplug: unify Huge/LRU/non-LRU movable folio isolation Kefeng Wang
2024-07-30 10:31   ` David Hildenbrand
2024-07-31  5:13     ` Kefeng Wang
2024-08-01 20:16       ` David Hildenbrand
2024-08-01 20:23   ` David Hildenbrand
2024-08-02  8:39     ` Kefeng Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=cbdc4aea-3b1b-4286-aab2-10d01b192080@redhat.com \
    --to=david@redhat.com \
    --cc=akpm@linux-foundation.org \
    --cc=linmiaohe@huawei.com \
    --cc=linux-mm@kvack.org \
    --cc=nao.horiguchi@gmail.com \
    --cc=osalvador@suse.de \
    --cc=wangkefeng.wang@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox