From: Yosry Ahmed <yosry.ahmed@linux.dev>
To: Lorenzo Stoakes <lorenzo.stoakes@oracle.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Christian Borntraeger <borntraeger@linux.ibm.com>,
Janosch Frank <frankja@linux.ibm.com>,
Claudio Imbrenda <imbrenda@linux.ibm.com>,
David Hildenbrand <david@redhat.com>,
Alexander Gordeev <agordeev@linux.ibm.com>,
Gerald Schaefer <gerald.schaefer@linux.ibm.com>,
Heiko Carstens <hca@linux.ibm.com>,
Vasily Gorbik <gor@linux.ibm.com>,
Sven Schnelle <svens@linux.ibm.com>, Zi Yan <ziy@nvidia.com>,
Baolin Wang <baolin.wang@linux.alibaba.com>,
"Liam R . Howlett" <Liam.Howlett@oracle.com>,
Nico Pache <npache@redhat.com>,
Ryan Roberts <ryan.roberts@arm.com>, Dev Jain <dev.jain@arm.com>,
Barry Song <baohua@kernel.org>,
Lance Yang <lance.yang@linux.dev>,
Kemeng Shi <shikemeng@huaweicloud.com>,
Kairui Song <kasong@tencent.com>, Nhat Pham <nphamcs@gmail.com>,
Baoquan He <bhe@redhat.com>, Chris Li <chrisl@kernel.org>,
Peter Xu <peterx@redhat.com>,
Matthew Wilcox <willy@infradead.org>,
Jason Gunthorpe <jgg@ziepe.ca>,
Leon Romanovsky <leon@kernel.org>,
Muchun Song <muchun.song@linux.dev>,
Oscar Salvador <osalvador@suse.de>,
Vlastimil Babka <vbabka@suse.cz>,
Mike Rapoport <rppt@kernel.org>,
Suren Baghdasaryan <surenb@google.com>,
Michal Hocko <mhocko@suse.com>, Jann Horn <jannh@google.com>,
Matthew Brost <matthew.brost@intel.com>,
Joshua Hahn <joshua.hahnjy@gmail.com>,
Rakie Kim <rakie.kim@sk.com>, Byungchul Park <byungchul@sk.com>,
Gregory Price <gourry@gourry.net>,
Ying Huang <ying.huang@linux.alibaba.com>,
Alistair Popple <apopple@nvidia.com>,
Pedro Falcato <pfalcato@suse.de>,
Pasha Tatashin <pasha.tatashin@soleen.com>,
Rik van Riel <riel@surriel.com>,
Harry Yoo <harry.yoo@oracle.com>,
kvm@vger.kernel.org, linux-s390@vger.kernel.org,
linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org,
linux-mm@kvack.org
Subject: Re: [RFC PATCH 00/12] remove is_swap_[pte, pmd]() + non-swap confusion
Date: Fri, 24 Oct 2025 20:05:33 +0000 [thread overview]
Message-ID: <7tjpibvbt2nwkkrzcbrsw3t3ehxckjrro6vxqukh4ld4memodx@cxfpmwbr3fo6> (raw)
In-Reply-To: <cover.1761288179.git.lorenzo.stoakes@oracle.com>
On Fri, Oct 24, 2025 at 08:41:16AM +0100, Lorenzo Stoakes wrote:
> There's an established convention in the kernel that we treat leaf page
> tables (so far at the PTE, PMD level) as containing 'swap entries' should
> they be neither empty (i.e. p**_none() evaluating true) nor present
> (i.e. p**_present() evaluating true).
>
> However, at the same time we also have helper predicates - is_swap_pte(),
> is_swap_pmd() - which are inconsistently used.
>
> This is problematic, as it is logical to assume that should somebody wish
> to operate upon a page table swap entry they should first check to see if
> it is in fact one.
>
> It also implies that perhaps, in future, we might introduce a non-present,
> none page table entry that is not a swap entry.
>
> This series resolves this issue by systematically eliminating all use of
> the is_swap_pte() and is swap_pmd() predicates so we retain only the
> convention that should a leaf page table entry be neither none nor present
> it is a swap entry.
>
> We also have the further issue that 'swap entry' is unfortunately a really
> rather overloaded term and in fact refers to both entries for swap and for
> other information such as migration entries, page table markers, and device
> private entries.
>
> We therefore have the rather 'unique' concept of a 'non-swap' swap entry.
>
> This is deeply confusing, so this series goes further and eliminates the
> non_swap_entry() predicate, replacing it with is_non_present_entry() - with
> an eye to a new convention of referring to these non-swap 'swap entries' as
> non-present.
I just wanted to say THANK YOU for doing this. It is indeed a very
annoying and confusing convention, and I wanted to do something about it
in the past but never got around to it..
>
> It also introduces the is_swap_entry() predicate to explicitly and
> logically refer to actual 'true' swap entries, improving code readibility,
> avoiding the hideous convention of:
>
> if (!non_swap_entry(entry)) {
> ...
> }
>
> As part of these changes we also introduce a few other new predicates:
>
> * pte_to_swp_entry_or_zero() - allows for convenient conversion from a PTE
> to a swap entry if present, or an empty swap entry if none. This is
> useful as many swap entry conversions are simply checking for flags for
> which this suffices.
>
> * get_pte_swap_entry() - Retrieves a PTE swap entry if it truly is a swap
> entry (i.e. not a non-present entry), returning true if so, otherwise
> returns false. This simplifies a lot of logic that previously open-coded
> this.
>
> * is_huge_pmd() - Determines if a PMD contains either a present transparent
> huge page entry or a huge non-present entry. This again simplifies a lot
> of logic that simply open-coded this.
>
> REVIEWERS NOTE:
>
> This series applies against mm-unstable as there are currently conflicts
> with mm-new. Should the series receive community assent I will resolve
> these at the point the RFC tag is removed.
>
> I also intend to use this as a foundation for further work to add higher
> order page table markers.
>
> Lorenzo Stoakes (12):
> mm: introduce and use pte_to_swp_entry_or_zero()
> mm: avoid unnecessary uses of is_swap_pte()
> mm: introduce get_pte_swap_entry() and use it
> mm: use get_pte_swap_entry() in debug pgtable + remove is_swap_pte()
> fs/proc/task_mmu: refactor pagemap_pmd_range()
> mm: avoid unnecessary use of is_swap_pmd()
> mm: introduce is_huge_pmd() and use where appropriate
> mm/huge_memory: refactor copy_huge_pmd() non-present logic
> mm/huge_memory: refactor change_huge_pmd() non-present logic
> mm: remove remaining is_swap_pmd() users and is_swap_pmd()
> mm: rename non_swap_entry() to is_non_present_entry()
> mm: provide is_swap_entry() and use it
>
> arch/s390/mm/gmap_helpers.c | 2 +-
> arch/s390/mm/pgtable.c | 2 +-
> fs/proc/task_mmu.c | 214 ++++++++++++++++++++--------------
> include/linux/huge_mm.h | 49 +++++---
> include/linux/swapops.h | 99 ++++++++++++++--
> include/linux/userfaultfd_k.h | 16 +--
> mm/debug_vm_pgtable.c | 43 ++++---
> mm/filemap.c | 2 +-
> mm/hmm.c | 2 +-
> mm/huge_memory.c | 189 ++++++++++++++++--------------
> mm/hugetlb.c | 6 +-
> mm/internal.h | 12 +-
> mm/khugepaged.c | 29 ++---
> mm/madvise.c | 14 +--
> mm/memory.c | 62 +++++-----
> mm/migrate.c | 2 +-
> mm/mincore.c | 2 +-
> mm/mprotect.c | 45 ++++---
> mm/mremap.c | 9 +-
> mm/page_table_check.c | 25 ++--
> mm/page_vma_mapped.c | 30 +++--
> mm/swap_state.c | 5 +-
> mm/swapfile.c | 3 +-
> mm/userfaultfd.c | 2 +-
> 24 files changed, 511 insertions(+), 353 deletions(-)
>
> --
> 2.51.0
next prev parent reply other threads:[~2025-10-24 20:05 UTC|newest]
Thread overview: 47+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-10-24 7:41 Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 01/12] mm: introduce and use pte_to_swp_entry_or_zero() Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 02/12] mm: avoid unnecessary uses of is_swap_pte() Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 03/12] mm: introduce get_pte_swap_entry() and use it Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 04/12] mm: use get_pte_swap_entry() in debug pgtable + remove is_swap_pte() Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 05/12] fs/proc/task_mmu: refactor pagemap_pmd_range() Lorenzo Stoakes
2025-10-24 17:32 ` Gregory Price
2025-10-24 18:19 ` Lorenzo Stoakes
2025-10-24 19:12 ` Gregory Price
2025-10-24 20:15 ` Lorenzo Stoakes
2025-10-24 20:37 ` Gregory Price
2025-10-27 15:26 ` Lorenzo Stoakes
2025-10-27 16:11 ` Jason Gunthorpe
2025-10-27 16:15 ` David Hildenbrand
2025-10-27 16:26 ` Lorenzo Stoakes
2025-10-27 16:31 ` David Hildenbrand
2025-10-27 16:38 ` Lorenzo Stoakes
2025-10-27 17:08 ` Alexander Gordeev
2025-10-28 12:52 ` Jason Gunthorpe
2025-10-28 13:09 ` Gregory Price
2025-10-28 17:36 ` Lorenzo Stoakes
2025-10-28 18:23 ` Lorenzo Stoakes
2025-10-27 16:38 ` Gregory Price
2025-10-24 7:41 ` [RFC PATCH 06/12] mm: avoid unnecessary use of is_swap_pmd() Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 07/12] mm: introduce is_huge_pmd() and use where appropriate Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 08/12] mm/huge_memory: refactor copy_huge_pmd() non-present logic Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 09/12] mm/huge_memory: refactor change_huge_pmd() " Lorenzo Stoakes
2025-10-24 18:41 ` Gregory Price
2025-10-24 18:44 ` Lorenzo Stoakes
2025-10-24 19:09 ` Gregory Price
2025-10-24 7:41 ` [RFC PATCH 10/12] mm: remove remaining is_swap_pmd() users and is_swap_pmd() Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 11/12] mm: rename non_swap_entry() to is_non_present_entry() Lorenzo Stoakes
2025-10-24 19:07 ` Gregory Price
2025-10-24 20:17 ` Lorenzo Stoakes
2025-10-24 7:41 ` [RFC PATCH 12/12] mm: provide is_swap_entry() and use it Lorenzo Stoakes
2025-10-24 20:05 ` Yosry Ahmed [this message]
2025-10-24 20:14 ` [RFC PATCH 00/12] remove is_swap_[pte, pmd]() + non-swap confusion Lorenzo Stoakes
2025-10-27 16:09 ` Jason Gunthorpe
2025-10-27 17:33 ` Lorenzo Stoakes
2025-10-28 12:48 ` Jason Gunthorpe
2025-10-28 18:20 ` Lorenzo Stoakes
2025-10-29 14:10 ` Jason Gunthorpe
2025-10-29 19:09 ` Lorenzo Stoakes
2025-10-29 21:23 ` Gregory Price
2025-10-30 10:21 ` Lorenzo Stoakes
2025-11-02 14:27 ` Lorenzo Stoakes
2025-10-27 23:32 ` Gregory Price
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=7tjpibvbt2nwkkrzcbrsw3t3ehxckjrro6vxqukh4ld4memodx@cxfpmwbr3fo6 \
--to=yosry.ahmed@linux.dev \
--cc=Liam.Howlett@oracle.com \
--cc=agordeev@linux.ibm.com \
--cc=akpm@linux-foundation.org \
--cc=apopple@nvidia.com \
--cc=baohua@kernel.org \
--cc=baolin.wang@linux.alibaba.com \
--cc=bhe@redhat.com \
--cc=borntraeger@linux.ibm.com \
--cc=byungchul@sk.com \
--cc=chrisl@kernel.org \
--cc=david@redhat.com \
--cc=dev.jain@arm.com \
--cc=frankja@linux.ibm.com \
--cc=gerald.schaefer@linux.ibm.com \
--cc=gor@linux.ibm.com \
--cc=gourry@gourry.net \
--cc=harry.yoo@oracle.com \
--cc=hca@linux.ibm.com \
--cc=imbrenda@linux.ibm.com \
--cc=jannh@google.com \
--cc=jgg@ziepe.ca \
--cc=joshua.hahnjy@gmail.com \
--cc=kasong@tencent.com \
--cc=kvm@vger.kernel.org \
--cc=lance.yang@linux.dev \
--cc=leon@kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linux-s390@vger.kernel.org \
--cc=lorenzo.stoakes@oracle.com \
--cc=matthew.brost@intel.com \
--cc=mhocko@suse.com \
--cc=muchun.song@linux.dev \
--cc=npache@redhat.com \
--cc=nphamcs@gmail.com \
--cc=osalvador@suse.de \
--cc=pasha.tatashin@soleen.com \
--cc=peterx@redhat.com \
--cc=pfalcato@suse.de \
--cc=rakie.kim@sk.com \
--cc=riel@surriel.com \
--cc=rppt@kernel.org \
--cc=ryan.roberts@arm.com \
--cc=shikemeng@huaweicloud.com \
--cc=surenb@google.com \
--cc=svens@linux.ibm.com \
--cc=vbabka@suse.cz \
--cc=willy@infradead.org \
--cc=ying.huang@linux.alibaba.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox