linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: Kefeng Wang <wangkefeng.wang@huawei.com>
To: "Luck, Tony" <tony.luck@intel.com>,
	"akpm@linux-foundation.org" <akpm@linux-foundation.org>,
	"linux-mm@kvack.org" <linux-mm@kvack.org>
Cc: Miaohe Lin <linmiaohe@huawei.com>,
	"nao.horiguchi@gmail.com" <nao.horiguchi@gmail.com>,
	Matthew Wilcox <willy@infradead.org>,
	David Hildenbrand <david@redhat.com>,
	Muchun Song <muchun.song@linux.dev>,
	Benjamin LaHaise <bcrl@kvack.org>,
	"jglisse@redhat.com" <jglisse@redhat.com>,
	Zi Yan <ziy@nvidia.com>, Jiaqi Yan <jiaqiyan@google.com>,
	Hugh Dickins <hughd@google.com>,
	Vishal Moola <vishal.moola@gmail.com>,
	Alistair Popple <apopple@nvidia.com>,
	"chu, jane" <jane.chu@oracle.com>,
	Oscar Salvador <osalvador@suse.de>
Subject: Re: [PATCH v3 4/5] fs: hugetlbfs: support poison recover from hugetlbfs_migrate_folio()
Date: Wed, 29 May 2024 10:48:10 +0800	[thread overview]
Message-ID: <2b455ec8-e776-488d-bbdc-3023f3908c18@huawei.com> (raw)
In-Reply-To: <SJ1PR11MB6083354CE70F67B18685BE30FCF12@SJ1PR11MB6083.namprd11.prod.outlook.com>



On 2024/5/28 23:41, Luck, Tony wrote:
> +	if (unlikely(folio_mc_copy(dst, src))) {
> +		folio_ref_unfreeze(src, expected_count);
> +		return -EFAULT;
> 
> It doesn't look like any code takes action to avoid re-using the poisoned page.
> 
> So you survived, hurrah! But left the problem page for some other code to trip over.

Hi Tony, thanks for your review,

We tried to avoid calling memory_failure_queue() after 
copy_mc_{user_}highpage(), and I think the memory_failure() should be
called by  ARCH's code(eg, mce in x86)[1] to handle the poisoned page,
but for current mainline, the x86 mce don't do that, so yes, we need a
memory_failure_queue() for x86, but it is not true for upcoming
arm64, the poisoned page is handled by apei_claim_sea(),and a new
memory_failure_queue() is unnecessary(no issue since the
TestSetPageHWPoison() check in memory_failure()).

It seems that the khugepaged[3][4] should do the same thing, we could
call memory_failure_queue() in copy_mc_{user_}highpage(), and remove
it from each caller, is that OK?

diff --git a/include/linux/highmem.h b/include/linux/highmem.h
index 00341b56d291..6b0d6f3c8580 100644
--- a/include/linux/highmem.h
+++ b/include/linux/highmem.h
@@ -352,6 +352,9 @@ static inline int copy_mc_user_highpage(struct page 
*to, struct page *from,
         kunmap_local(vto);
         kunmap_local(vfrom);

+       if (ret)
+               memory_failure_queue(page_to_pfn(from), 0);
+
         return ret;
  }

@@ -368,6 +371,9 @@ static inline int copy_mc_highpage(struct page *to, 
struct page *from)
         kunmap_local(vto);
         kunmap_local(vfrom);

+       if (ret)
+               memory_failure_queue(page_to_pfn(from), 0);
+
         return ret;
  }

Thanks.




[1] 
https://lore.kernel.org/linux-mm/20240204082627.3892816-3-tongtiangen@huawei.com/
[2] 
https://lore.kernel.org/linux-mm/20240528085915.1955987-1-tongtiangen@huawei.com/
[3] 12904d953364 mm/khugepaged: recover from poisoned file-backed memory
[4] 6efc7afb5cc9 mm/hwpoison: introduce copy_mc_highpage



> 
> -Tony


  parent reply	other threads:[~2024-05-29  2:48 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-05-28 13:45 [PATCH v3 0/5] mm: migrate: support poison recover from migrate folio Kefeng Wang
2024-05-28 13:45 ` [PATCH v3 1/5] mm: add folio_mc_copy() Kefeng Wang
2024-05-28 14:17   ` Lance Yang
2024-05-29  2:16     ` Kefeng Wang
2024-05-28 20:13   ` Jane Chu
2024-05-29  2:16     ` Kefeng Wang
2024-05-28 13:45 ` [PATCH v3 2/5] mm: migrate: split folio_migrate_mapping() Kefeng Wang
2024-05-28 13:45 ` [PATCH v3 3/5] mm: migrate: support poisoned recover from migrate folio Kefeng Wang
2024-05-28 20:15   ` Jane Chu
2024-05-29  2:19     ` Kefeng Wang
2024-05-28 13:45 ` [PATCH v3 4/5] fs: hugetlbfs: support poison recover from hugetlbfs_migrate_folio() Kefeng Wang
2024-05-28 15:41   ` Luck, Tony
2024-05-28 20:19     ` Jane Chu
2024-05-28 20:30       ` Luck, Tony
2024-05-28 22:02         ` Jane Chu
2024-05-28 22:10           ` Luck, Tony
2024-05-29  2:48     ` Kefeng Wang [this message]
2024-05-28 13:45 ` [PATCH v3 5/5] mm: migrate: remove folio_migrate_copy() Kefeng Wang

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=2b455ec8-e776-488d-bbdc-3023f3908c18@huawei.com \
    --to=wangkefeng.wang@huawei.com \
    --cc=akpm@linux-foundation.org \
    --cc=apopple@nvidia.com \
    --cc=bcrl@kvack.org \
    --cc=david@redhat.com \
    --cc=hughd@google.com \
    --cc=jane.chu@oracle.com \
    --cc=jglisse@redhat.com \
    --cc=jiaqiyan@google.com \
    --cc=linmiaohe@huawei.com \
    --cc=linux-mm@kvack.org \
    --cc=muchun.song@linux.dev \
    --cc=nao.horiguchi@gmail.com \
    --cc=osalvador@suse.de \
    --cc=tony.luck@intel.com \
    --cc=vishal.moola@gmail.com \
    --cc=willy@infradead.org \
    --cc=ziy@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox