From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 56D48C3DA59 for ; Fri, 19 Jul 2024 13:33:58 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id C2C9E6B0088; Fri, 19 Jul 2024 09:33:57 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id BB5876B0089; Fri, 19 Jul 2024 09:33:57 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id A55DC6B008C; Fri, 19 Jul 2024 09:33:57 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 7F31D6B0088 for ; Fri, 19 Jul 2024 09:33:57 -0400 (EDT) Received: from smtpin21.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay06.hostedemail.com (Postfix) with ESMTP id 3B5FCA212D for ; Fri, 19 Jul 2024 13:33:57 +0000 (UTC) X-FDA: 82356595314.21.4CAE589 Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) by imf14.hostedemail.com (Postfix) with ESMTP id 9BA49100014 for ; Fri, 19 Jul 2024 13:33:53 +0000 (UTC) Authentication-Results: imf14.hostedemail.com; dkim=none; dmarc=pass (policy=quarantine) header.from=huawei.com; spf=pass (imf14.hostedemail.com: domain of jonathan.cameron@huawei.com designates 185.176.79.56 as permitted sender) smtp.mailfrom=jonathan.cameron@huawei.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1721395987; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=bxy8NgqkbcMfeyUcfjsZgf35rR5jihtVzo89MmJPvD4=; b=xUM4/8rvNiWbzV2A7B1PeZD5MaR/yKeGbGy2NODrqUQvwck7nkzTjTNc7LbuBTjNRPFJ/O 2xEFsEuFI4wmPQqEpLSS1jaE7Lb11oSMDcZd50PImLYJlvubVEMhMibO2iGexg1we1AjKm v5H1kkt9PFKmnL8m6ZKUikwsay1Yrjc= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1721395987; a=rsa-sha256; cv=none; b=lEbMSoA2th91qC4dStYGxbs0lKTTItSEFDXzvnCqRxfcv0ejViAOdKg9pIiMXQyjm1VZGn H40E+B7KvjwUWgRMJKT81emKv/c1IflK+zkm6WJlz8geDPJu6Od5I9ut3ywa/sfaz3coI1 wBMvYGoQsVU/4HOn4ziAhZpMyfHKY8U= ARC-Authentication-Results: i=1; imf14.hostedemail.com; dkim=none; dmarc=pass (policy=quarantine) header.from=huawei.com; spf=pass (imf14.hostedemail.com: domain of jonathan.cameron@huawei.com designates 185.176.79.56 as permitted sender) smtp.mailfrom=jonathan.cameron@huawei.com Received: from mail.maildlp.com (unknown [172.18.186.31]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4WQVw90sR6z6JBj4; Fri, 19 Jul 2024 21:32:25 +0800 (CST) Received: from lhrpeml500005.china.huawei.com (unknown [7.191.163.240]) by mail.maildlp.com (Postfix) with ESMTPS id 4D842140A87; Fri, 19 Jul 2024 21:33:49 +0800 (CST) Received: from localhost (10.122.19.247) by lhrpeml500005.china.huawei.com (7.191.163.240) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.1.2507.39; Fri, 19 Jul 2024 14:33:48 +0100 Date: Fri, 19 Jul 2024 14:33:47 +0100 From: Jonathan Cameron To: Mike Rapoport CC: , Alexander Gordeev , Andreas Larsson , "Andrew Morton" , Arnd Bergmann , "Borislav Petkov" , Catalin Marinas , Christophe Leroy , Dan Williams , Dave Hansen , David Hildenbrand , "David S. Miller" , Greg Kroah-Hartman , Heiko Carstens , Huacai Chen , Ingo Molnar , Jiaxun Yang , "John Paul Adrian Glaubitz" , Michael Ellerman , Palmer Dabbelt , "Rafael J. Wysocki" , Rob Herring , "Thomas Bogendoerfer" , Thomas Gleixner , Vasily Gorbik , Will Deacon , , , , , , , , , , , , , , , Subject: Re: [PATCH 00/17] mm: introduce numa_memblks Message-ID: <20240719143347.000077d9@huawei.com> In-Reply-To: <20240716111346.3676969-1-rppt@kernel.org> References: <20240716111346.3676969-1-rppt@kernel.org> X-Mailer: Claws Mail 4.0.0 (GTK+ 3.24.29; x86_64-w64-mingw32) MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: quoted-printable X-Originating-IP: [10.122.19.247] X-ClientProxiedBy: lhrpeml100002.china.huawei.com (7.191.160.241) To lhrpeml500005.china.huawei.com (7.191.163.240) X-Rspamd-Server: rspam07 X-Rspamd-Queue-Id: 9BA49100014 X-Stat-Signature: pokju5ixkzr5hwtdoe9gdrmw71eeoaux X-Rspam-User: X-HE-Tag: 1721396033-591317 X-HE-Meta: U2FsdGVkX1+9uLaztyLhj564Wq3HjVkilwKSuOxBUXxCRu5prUrPYZo+G1Y5+zDesqXo0H000qEyyVzbRMvSgqy/y2l46N9/lZ0ICTyLxo/XV+Pw7dmccYptdXa60XofEivNztw3PRrAWb/JNiLF6vi2qwqUOiY298x9bh/20NDxDdMYWsmrtXDGAPBYvBpn0FsquzR5GKokqoLB4Z6s0yKyDm5sxdqUyeFBWhqslZwSq/rRSa0NvYHpCWEoA0avHx0GGkSUt7pLvkCHZVLP/r6X2gtlE1hEwksOvAChO4W0HqWlSf8BNy/eo/HDvTaLK9Ya/g9N2RVoTFmzqDsxGLnj9SjwDeoQjhnvU1Kzd+RT/6qU+Xk/eJRnaibutFgRbPELsOFDkjJ6ogLpwtNoEktrLlr9RfiIU/1iy/h7Pg4kTZYWiT3Ue6eeWD8IHA9sGPpd13fG7yH3C+0+NlU8LRX6zMp4hY1K+lIp2aPxFR0SUZSwEMyeM5PEY34U6tzrkaakEVQZ2zRqijKtqF3coeoxM+I8FsmIhMk9vn0YHZwAaVnm+CrAnF+L0oDi+Kofb0y2Hn/DORUovms7OsW6Ka1z0aiDYW1pvA94/iJFGLwv9lnzBKbjL1Xmah5G+u+7Tew62WGx7tCUJs/MQAM9w/3yuUUoILwdJXVBl9nrNfw97uU0C+8v9czLiEefTGVIKkHfc2UqArysQnUCaeOssg83bkaLoclmuNusBZFssQf/ZvgW3kbosCW9Ty6lsLnzIO8JvSPc8qqf+U9s3YvBe1ky/sDwVfS9rz8QwQimmpr0DE3fazU+pML9e1ULRmR3voFDGMzzTHi3cKPrM4vVDDAL2ZN4TSV9JKU31cDNNXz1g5fgDE8NEVBx5OXmGQ07gq0EW4o9tcmujMIBOsNJazL2zfjfI9VBjMmgdG0vcyE3U6bMRpIae1Y3eAyw8hdrHU70zBKXbKAqBJTibq8 BhgEcCty nADu+nWFLsdIVWUXTwgmhtyClO0aCo0ZcSMNOxuU2fnU0GUJlBpKbeZ8FRp48AyyUKyn/f8uOiV9qptRPEKc3LvVIjvrLOyjUZk42MOK534cabiMzmwPV5RU5XW3D6u/nebimzD9UH04OIAueNMoXYfJiNLy9c72cIQGIXTBOJbYP/ItW/D/vieK37DDr/1D0w8xAW3NI1uVaIcsxvgSuHhQxsHWyGtyABt+lClOS6Cksb9hDXT3EnkNJWWwcc+sfiNz5pP2Vhh5ZNKI= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On Tue, 16 Jul 2024 14:13:29 +0300 Mike Rapoport wrote: > From: "Mike Rapoport (Microsoft)" >=20 > Hi, >=20 > Following the discussion about handling of CXL fixed memory windows on > arm64 [1] I decided to bite the bullet and move numa_memblks from x86 to > the generic code so they will be available on arm64/riscv and maybe on > loongarch sometime later. >=20 > While it could be possible to use memblock to describe CXL memory windows, > it currently lacks notion of unpopulated memory ranges and numa_memblks > does implement this. >=20 > Another reason to make numa_memblks generic is that both arch_numa (arm64 > and riscv) and loongarch use trimmed copy of x86 code although there is no > fundamental reason why the same code cannot be used on all these platform= s. > Having numa_memblks in mm/ will make it's interaction with ACPI and FDT > more consistent and I believe will reduce maintenance burden. >=20 > And with generic numa_memblks it is (almost) straightforward to enable NU= MA > emulation on arm64 and riscv. >=20 > The first 5 commits in this series are cleanups that are not strictly > related to numa_memblks. >=20 > Commits 6-11 slightly reorder code in x86 to allow extracting numa_memblks > and NUMA emulation to the generic code. >=20 > Commits 12-14 actually move the code from arch/x86/ to mm/ and commit 15 > does some aftermath cleanups. >=20 > Commit 16 switches arch_numa to numa_memblks. >=20 > Commit 17 enables usage of phys_to_target_node() and > memory_add_physaddr_to_nid() with numa_memblks. Hi Mike, I've lightly tested with emulated CXL + Generic Ports and Generic Initiators as well as more normal cpus and memory via qemu on arm64 and it's looking good. =46rom my earlier series, patch 4 is probably still needed to avoid presenting nodes with nothing in them at boot (but not if we hotplug memory then remove it again in which case they disappear) https://lore.kernel.org/all/20240529171236.32002-5-Jonathan.Cameron@huawei.= com/ However that was broken/inconsistent before your rework so I can send that patch separately.=20 Thanks for getting this sorted! I should get time to do more extensive testing and review in next week or so. Jonathan >=20 > [1] https://lore.kernel.org/all/20240529171236.32002-1-Jonathan.Cameron@h= uawei.com/ >=20 > Mike Rapoport (Microsoft) (17): > mm: move kernel/numa.c to mm/ > MIPS: sgi-ip27: make NODE_DATA() the same as on all other > architectures > MIPS: loongson64: rename __node_data to node_data > arch, mm: move definition of node_data to generic code > arch, mm: pull out allocation of NODE_DATA to generic code > x86/numa: simplify numa_distance allocation > x86/numa: move FAKE_NODE_* defines to numa_emu > x86/numa_emu: simplify allocation of phys_dist > x86/numa_emu: split __apicid_to_node update to a helper function > x86/numa_emu: use a helper function to get MAX_DMA32_PFN > x86/numa: numa_{add,remove}_cpu: make cpu parameter unsigned > mm: introduce numa_memblks > mm: move numa_distance and related code from x86 to numa_memblks > mm: introduce numa_emulation > mm: make numa_memblks more self-contained > arch_numa: switch over to numa_memblks > mm: make range-to-target_node lookup facility a part of numa_memblks >=20 > arch/arm64/include/asm/Kbuild | 1 + > arch/arm64/include/asm/mmzone.h | 13 - > arch/arm64/include/asm/topology.h | 1 + > arch/loongarch/include/asm/Kbuild | 1 + > arch/loongarch/include/asm/mmzone.h | 16 - > arch/loongarch/include/asm/topology.h | 1 + > arch/loongarch/kernel/numa.c | 21 - > arch/mips/include/asm/mach-ip27/mmzone.h | 1 - > .../mips/include/asm/mach-loongson64/mmzone.h | 4 - > arch/mips/loongson64/numa.c | 20 +- > arch/mips/sgi-ip27/ip27-memory.c | 2 +- > arch/powerpc/include/asm/mmzone.h | 6 - > arch/powerpc/mm/numa.c | 26 +- > arch/riscv/include/asm/Kbuild | 1 + > arch/riscv/include/asm/mmzone.h | 13 - > arch/riscv/include/asm/topology.h | 4 + > arch/s390/include/asm/Kbuild | 1 + > arch/s390/include/asm/mmzone.h | 17 - > arch/s390/kernel/numa.c | 3 - > arch/sh/include/asm/mmzone.h | 3 - > arch/sh/mm/init.c | 7 +- > arch/sh/mm/numa.c | 3 - > arch/sparc/include/asm/mmzone.h | 4 - > arch/sparc/mm/init_64.c | 11 +- > arch/x86/Kconfig | 9 +- > arch/x86/include/asm/Kbuild | 1 + > arch/x86/include/asm/mmzone.h | 6 - > arch/x86/include/asm/mmzone_32.h | 17 - > arch/x86/include/asm/mmzone_64.h | 18 - > arch/x86/include/asm/numa.h | 24 +- > arch/x86/include/asm/sparsemem.h | 9 - > arch/x86/mm/Makefile | 1 - > arch/x86/mm/amdtopology.c | 1 + > arch/x86/mm/numa.c | 618 +----------------- > arch/x86/mm/numa_internal.h | 24 - > drivers/acpi/numa/srat.c | 1 + > drivers/base/Kconfig | 1 + > drivers/base/arch_numa.c | 223 ++----- > drivers/cxl/Kconfig | 2 +- > drivers/dax/Kconfig | 2 +- > drivers/of/of_numa.c | 1 + > include/asm-generic/mmzone.h | 5 + > include/asm-generic/numa.h | 6 +- > include/linux/numa.h | 5 + > include/linux/numa_memblks.h | 58 ++ > kernel/Makefile | 1 - > kernel/numa.c | 26 - > mm/Kconfig | 11 + > mm/Makefile | 3 + > mm/numa.c | 57 ++ > {arch/x86/mm =3D> mm}/numa_emulation.c | 42 +- > mm/numa_memblks.c | 565 ++++++++++++++++ > 52 files changed, 847 insertions(+), 1070 deletions(-) > delete mode 100644 arch/arm64/include/asm/mmzone.h > delete mode 100644 arch/loongarch/include/asm/mmzone.h > delete mode 100644 arch/riscv/include/asm/mmzone.h > delete mode 100644 arch/s390/include/asm/mmzone.h > delete mode 100644 arch/x86/include/asm/mmzone.h > delete mode 100644 arch/x86/include/asm/mmzone_32.h > delete mode 100644 arch/x86/include/asm/mmzone_64.h > create mode 100644 include/asm-generic/mmzone.h > create mode 100644 include/linux/numa_memblks.h > delete mode 100644 kernel/numa.c > create mode 100644 mm/numa.c > rename {arch/x86/mm =3D> mm}/numa_emulation.c (94%) > create mode 100644 mm/numa_memblks.c >=20 >=20 > base-commit: 22a40d14b572deb80c0648557f4bd502d7e83826