linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v4 0/1] Identify the accurate NUMA ID of CFMW
@ 2026-02-13  6:03 Cui Chao
  2026-02-13  6:03 ` [PATCH v4 1/1] mm: numa_memblks: " Cui Chao
  2026-02-14  8:06 ` [PATCH v4 0/1] " Mike Rapoport
  0 siblings, 2 replies; 3+ messages in thread
From: Cui Chao @ 2026-02-13  6:03 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Jonathan Cameron, Mike Rapoport, Wang Yinfeng, dan.j.williams,
	Pratyush Brahma, Gregory Price, David Hildenbrand, linux-cxl,
	linux-kernel, linux-mm, qemu-devel

Changes in v4:
- Condensed the impact description in changelog.
- Added Gregory's Reviewed-by tag.
- Added Dan's Reviewed-by tag.
- Added Cc: <stable@vger.kernel.org> tag

Cui Chao (1):
  mm: numa_memblks: Identify the accurate NUMA ID of CFMW

 mm/numa_memblks.c | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

-- 
2.33.0



^ permalink raw reply	[flat|nested] 3+ messages in thread

* [PATCH v4 1/1] mm: numa_memblks: Identify the accurate NUMA ID of CFMW
  2026-02-13  6:03 [PATCH v4 0/1] Identify the accurate NUMA ID of CFMW Cui Chao
@ 2026-02-13  6:03 ` Cui Chao
  2026-02-14  8:06 ` [PATCH v4 0/1] " Mike Rapoport
  1 sibling, 0 replies; 3+ messages in thread
From: Cui Chao @ 2026-02-13  6:03 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Jonathan Cameron, Mike Rapoport, Wang Yinfeng, dan.j.williams,
	Pratyush Brahma, Gregory Price, David Hildenbrand, linux-cxl,
	linux-kernel, linux-mm, qemu-devel, stable, Jonathan Cameron

In some physical memory layout designs, the address space of CFMW (CXL
Fixed Memory Window) resides between multiple segments of system memory
belonging to the same NUMA node. In numa_cleanup_meminfo, these multiple
segments of system memory are merged into a larger numa_memblk. When
identifying which NUMA node the CFMW belongs to, it may be incorrectly
assigned to the NUMA node of the merged system memory.

When a CXL RAM region is created in userspace, the memory capacity of
the newly created region is not added to the CFMW-dedicated NUMA node.
Instead, it is accumulated into an existing NUMA node (e.g., NUMA0
containing RAM). This makes it impossible to clearly distinguish
between the two types of memory, which may affect memory-tiering
applications.

Example memory layout:

Physical address space:
    0x00000000 - 0x1FFFFFFF  System RAM (node0)
    0x20000000 - 0x2FFFFFFF  CXL CFMW (node2)
    0x40000000 - 0x5FFFFFFF  System RAM (node0)
    0x60000000 - 0x7FFFFFFF  System RAM (node1)

After numa_cleanup_meminfo, the two node0 segments are merged into one:
    0x00000000 - 0x5FFFFFFF  System RAM (node0) // CFMW is inside the range
    0x60000000 - 0x7FFFFFFF  System RAM (node1)

So the CFMW (0x20000000-0x2FFFFFFF) will be incorrectly assigned to node0.

To address this scenario, accurately identifying the correct NUMA node
can be achieved by checking whether the region belongs to both
numa_meminfo and numa_reserved_meminfo.

While this issue is only observed in a QEMU configuration, and no known
end users are impacted by this problem, it is likely that some firmware
implementation is leaving memory map holes in a CXL Fixed Memory Window.
CXL hotplug depends on mapping free window capacity, and it seems to be
only a coincidence to have not hit this problem yet.

Fixes: 779dd20cfb56 ("cxl/region: Add region creation support")
Signed-off-by: Cui Chao <cuichao1753@phytium.com.cn>
Cc: <stable@vger.kernel.org>
Reviewed-by: Jonathan Cameron <jonathan.cameron@huawei.com>
Reviewed-by: Gregory Price <gourry@gourry.net>
Reviewed-by: Dan Williams <dan.j.williams@intel.com>
---
 mm/numa_memblks.c | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

diff --git a/mm/numa_memblks.c b/mm/numa_memblks.c
index 5b009a9cd8b4..0892d532908c 100644
--- a/mm/numa_memblks.c
+++ b/mm/numa_memblks.c
@@ -568,15 +568,16 @@ static int meminfo_to_nid(struct numa_meminfo *mi, u64 start)
 int phys_to_target_node(u64 start)
 {
 	int nid = meminfo_to_nid(&numa_meminfo, start);
+	int reserved_nid = meminfo_to_nid(&numa_reserved_meminfo, start);
 
 	/*
-	 * Prefer online nodes, but if reserved memory might be
-	 * hot-added continue the search with reserved ranges.
+	 * Prefer online nodes unless the address is also described
+	 * by reserved ranges, in which case use the reserved nid.
 	 */
-	if (nid != NUMA_NO_NODE)
+	if (nid != NUMA_NO_NODE && reserved_nid == NUMA_NO_NODE)
 		return nid;
 
-	return meminfo_to_nid(&numa_reserved_meminfo, start);
+	return reserved_nid;
 }
 EXPORT_SYMBOL_GPL(phys_to_target_node);
 
-- 
2.33.0



^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH v4 0/1] Identify the accurate NUMA ID of CFMW
  2026-02-13  6:03 [PATCH v4 0/1] Identify the accurate NUMA ID of CFMW Cui Chao
  2026-02-13  6:03 ` [PATCH v4 1/1] mm: numa_memblks: " Cui Chao
@ 2026-02-14  8:06 ` Mike Rapoport
  1 sibling, 0 replies; 3+ messages in thread
From: Mike Rapoport @ 2026-02-14  8:06 UTC (permalink / raw)
  To: Andrew Morton, Cui Chao
  Cc: Mike Rapoport, Jonathan Cameron, Wang Yinfeng, dan.j.williams,
	Pratyush Brahma, Gregory Price, David Hildenbrand, linux-cxl,
	linux-kernel, linux-mm, qemu-devel

On Fri, 13 Feb 2026 14:03:46 +0800, Cui Chao wrote:
> Changes in v4:
> - Condensed the impact description in changelog.
> - Added Gregory's Reviewed-by tag.
> - Added Dan's Reviewed-by tag.
> - Added Cc: <stable@vger.kernel.org> tag
> 
> Cui Chao (1):
>   mm: numa_memblks: Identify the accurate NUMA ID of CFMW
> 
> [...]

Applied to fixes branch of memblock.git tree, thanks!

[1/1] mm: numa_memblks: Identify the accurate NUMA ID of CFMW
      commit: f043a93fff9e3e3e648b6525483f59104b0819fa

tree: https://git.kernel.org/pub/scm/linux/kernel/git/rppt/memblock
branch: fixes

--
Sincerely yours,
Mike.



^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2026-02-14  8:06 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2026-02-13  6:03 [PATCH v4 0/1] Identify the accurate NUMA ID of CFMW Cui Chao
2026-02-13  6:03 ` [PATCH v4 1/1] mm: numa_memblks: " Cui Chao
2026-02-14  8:06 ` [PATCH v4 0/1] " Mike Rapoport

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox