From: Jinjiang Tu <tujinjiang@huawei.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: <akpm@linux-foundation.org>, <david@kernel.org>,
<lorenzo.stoakes@oracle.com>, <ziy@nvidia.com>,
<baolin.wang@linux.alibaba.com>, <Liam.Howlett@oracle.com>,
<npache@redhat.com>, <ryan.roberts@arm.com>, <dev.jain@arm.com>,
<baohua@kernel.org>, <lance.yang@linux.dev>,
<shardul.b@mpiricsoftware.com>, <linux-mm@kvack.org>,
<linux-fsdevel@vger.kernel.org>, <wangkefeng.wang@huawei.com>
Subject: Re: [RFC PATCH] mm/khugepaged: free empty xa_nodes when rollbacks in collapse_file
Date: Fri, 23 Jan 2026 14:51:21 +0800 [thread overview]
Message-ID: <a1f97fe2-fa7c-4411-b2a0-d19257653863@huawei.com> (raw)
In-Reply-To: <aXB0Zcu4bIEvSSuX@casper.infradead.org>
在 2026/1/21 14:38, Matthew Wilcox 写道:
> On Wed, Jan 21, 2026 at 02:22:43PM +0800, Jinjiang Tu wrote:
>> collapse_file() calls xas_create_range() to pre-create all slots needed.
>> If collapse_file() finally fails, these pre-created slots are empty nodes
>> and aren't destroyed.
> try this instead
>
> diff --git a/fs/inode.c b/fs/inode.c
> index cff1d3af0d57..85886af1e7ab 100644
> --- a/fs/inode.c
> +++ b/fs/inode.c
> @@ -744,22 +744,18 @@ void dump_mapping(const struct address_space *mapping)
>
> void clear_inode(struct inode *inode)
> {
> - /*
> - * We have to cycle the i_pages lock here because reclaim can be in the
> - * process of removing the last page (in __filemap_remove_folio())
> - * and we must not free the mapping under it.
> - */
> - xa_lock_irq(&inode->i_data.i_pages);
> - BUG_ON(inode->i_data.nrpages);
> /*
> * Almost always, mapping_empty(&inode->i_data) here; but there are
> * two known and long-standing ways in which nodes may get left behind
> * (when deep radix-tree node allocation failed partway; or when THP
> - * collapse_file() failed). Until those two known cases are cleaned up,
> - * or a cleanup function is called here, do not BUG_ON(!mapping_empty),
> - * nor even WARN_ON(!mapping_empty).
> + * collapse_file() failed).
> + *
> + * xa_destroy() also cycles the lock for us, which is needed because
> + * reclaim can be in the process of removing the last folio (in
> + * __filemap_remove_folio()) and we must not free the mapping under it.
> */
> - xa_unlock_irq(&inode->i_data.i_pages);
> + xa_destroy(&inode->i_data.i_pages);
> + BUG_ON(inode->i_data.nrpages);
> BUG_ON(!list_empty(&inode->i_data.i_private_list));
> BUG_ON(!(inode->i_state & I_FREEING));
> BUG_ON(inode->i_state & I_CLEAR);
Hi, Matthew
This appoach is much simpler. Could you please send a formal patch?
prev parent reply other threads:[~2026-01-23 6:51 UTC|newest]
Thread overview: 4+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-21 6:22 Jinjiang Tu
2026-01-21 6:38 ` Matthew Wilcox
2026-01-21 8:19 ` Jinjiang Tu
2026-01-23 6:51 ` Jinjiang Tu [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=a1f97fe2-fa7c-4411-b2a0-d19257653863@huawei.com \
--to=tujinjiang@huawei.com \
--cc=Liam.Howlett@oracle.com \
--cc=akpm@linux-foundation.org \
--cc=baohua@kernel.org \
--cc=baolin.wang@linux.alibaba.com \
--cc=david@kernel.org \
--cc=dev.jain@arm.com \
--cc=lance.yang@linux.dev \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lorenzo.stoakes@oracle.com \
--cc=npache@redhat.com \
--cc=ryan.roberts@arm.com \
--cc=shardul.b@mpiricsoftware.com \
--cc=wangkefeng.wang@huawei.com \
--cc=willy@infradead.org \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox