From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 77621E77188 for ; Mon, 6 Jan 2025 19:03:32 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id AE3376B0082; Mon, 6 Jan 2025 14:03:31 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id A93446B0083; Mon, 6 Jan 2025 14:03:31 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 934106B0088; Mon, 6 Jan 2025 14:03:31 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0012.hostedemail.com [216.40.44.12]) by kanga.kvack.org (Postfix) with ESMTP id 78C716B0082 for ; Mon, 6 Jan 2025 14:03:31 -0500 (EST) Received: from smtpin12.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay08.hostedemail.com (Postfix) with ESMTP id DED55140250 for ; Mon, 6 Jan 2025 19:03:30 +0000 (UTC) X-FDA: 82977950580.12.73BCBF4 Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.16]) by imf29.hostedemail.com (Postfix) with ESMTP id 255AC120018 for ; Mon, 6 Jan 2025 19:03:27 +0000 (UTC) Authentication-Results: imf29.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=QWAOwH87; spf=pass (imf29.hostedemail.com: domain of dave.hansen@intel.com designates 192.198.163.16 as permitted sender) smtp.mailfrom=dave.hansen@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1736190208; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=Ai3gMpcE1TpweiqvBiDoZ6noaLD2R8boi8e9Hi4YXaQ=; b=wBv3YvSK4n/aERzUxhgPWE8ZijkrizILkKdoLFkYM9CRkJxl0lHEnSFwaS6uIjPuIiOkVV og05f2FyMVCKSuT27TVHs5gUQbUBFT9WK+q64BQ4p2qkRx6LJb0irIsp3dWUhMnk0JZboG WujJSnkSni25gpdGOmelwpKZyKSmxZE= ARC-Authentication-Results: i=1; imf29.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=QWAOwH87; spf=pass (imf29.hostedemail.com: domain of dave.hansen@intel.com designates 192.198.163.16 as permitted sender) smtp.mailfrom=dave.hansen@intel.com; dmarc=pass (policy=none) header.from=intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1736190208; a=rsa-sha256; cv=none; b=LRwi133Vz52e1A5bV9Qa+mmVb1h4++IkZkwiy72syjD0SOsTPThAoNjsDHPO/EE6N8T/46 aSbKryp8/gpnHGMpc4SEw7DPWZ6Owj6DPRXSzblB11tdS+n2qhEUBFC1qOXI2VMpW35AKK EMnX5x/jckQZA/S/EXOktg90oe4YQrc= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1736190208; x=1767726208; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=xb99DTZf6ntkZ3NTNGTbRiXhK4pSQEy30NliH/0p//8=; b=QWAOwH87wF8GRtGwTgjr9A3+//+zHVzBEVwSL3KYUBAZL1n1nPM2RvEH LlLGeaS6994jGPP9F9Brl/1ViHTg8nCB+E+LNAFC1FP7cv1T8p0pMrpd/ pFfoprO1MYt8xjxTgQko1I504tgheUX3DeLwCr4EcqTp7OvH5f9J/HZjC mVlxbP2UJItoq6pSC/82GVbpYW79NZuX9xkJkXGhWQMXKlUza/6HJbwwi 8xNNbLmWR2BEhjQeAumSyh+iUpiQ10ZjLfQa0MjwV/jxTGH4fMxgUNtry NpJKPjqsIo34qIbFWs2z0zWspPSG/MQw9MZz8kS7v3jfNkog2i8srGzzW w==; X-CSE-ConnectionGUID: NOxb3tzpQuipKjY/FU9FIA== X-CSE-MsgGUID: lf1hxhf8SZuy8Zqv18zKBg== X-IronPort-AV: E=McAfee;i="6700,10204,11307"; a="23957870" X-IronPort-AV: E=Sophos;i="6.12,293,1728975600"; d="scan'208";a="23957870" Received: from fmviesa004.fm.intel.com ([10.60.135.144]) by fmvoesa110.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 06 Jan 2025 11:03:26 -0800 X-CSE-ConnectionGUID: vReTvL8/TS+oOEo2+fndJg== X-CSE-MsgGUID: HkCuvYe1Ty2UfQrOiTdWNw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.12,293,1728975600"; d="scan'208";a="107387964" Received: from lbogdanm-mobl3.ger.corp.intel.com (HELO [10.124.220.224]) ([10.124.220.224]) by fmviesa004-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 06 Jan 2025 11:03:24 -0800 Message-ID: Date: Mon, 6 Jan 2025 11:03:24 -0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v3 00/12] AMD broadcast TLB invalidation To: Rik van Riel , x86@kernel.org Cc: linux-kernel@vger.kernel.org, kernel-team@meta.com, dave.hansen@linux.intel.com, luto@kernel.org, peterz@infradead.org, tglx@linutronix.de, mingo@redhat.com, bp@alien8.de, hpa@zytor.com, akpm@linux-foundation.org, nadav.amit@gmail.com, zhengqi.arch@bytedance.com, linux-mm@kvack.org References: <20241230175550.4046587-1-riel@surriel.com> From: Dave Hansen Content-Language: en-US Autocrypt: addr=dave.hansen@intel.com; keydata= xsFNBE6HMP0BEADIMA3XYkQfF3dwHlj58Yjsc4E5y5G67cfbt8dvaUq2fx1lR0K9h1bOI6fC oAiUXvGAOxPDsB/P6UEOISPpLl5IuYsSwAeZGkdQ5g6m1xq7AlDJQZddhr/1DC/nMVa/2BoY 2UnKuZuSBu7lgOE193+7Uks3416N2hTkyKUSNkduyoZ9F5twiBhxPJwPtn/wnch6n5RsoXsb ygOEDxLEsSk/7eyFycjE+btUtAWZtx+HseyaGfqkZK0Z9bT1lsaHecmB203xShwCPT49Blxz VOab8668QpaEOdLGhtvrVYVK7x4skyT3nGWcgDCl5/Vp3TWA4K+IofwvXzX2ON/Mj7aQwf5W iC+3nWC7q0uxKwwsddJ0Nu+dpA/UORQWa1NiAftEoSpk5+nUUi0WE+5DRm0H+TXKBWMGNCFn c6+EKg5zQaa8KqymHcOrSXNPmzJuXvDQ8uj2J8XuzCZfK4uy1+YdIr0yyEMI7mdh4KX50LO1 pmowEqDh7dLShTOif/7UtQYrzYq9cPnjU2ZW4qd5Qz2joSGTG9eCXLz5PRe5SqHxv6ljk8mb ApNuY7bOXO/A7T2j5RwXIlcmssqIjBcxsRRoIbpCwWWGjkYjzYCjgsNFL6rt4OL11OUF37wL QcTl7fbCGv53KfKPdYD5hcbguLKi/aCccJK18ZwNjFhqr4MliQARAQABzUVEYXZpZCBDaHJp c3RvcGhlciBIYW5zZW4gKEludGVsIFdvcmsgQWRkcmVzcykgPGRhdmUuaGFuc2VuQGludGVs LmNvbT7CwXgEEwECACIFAlQ+9J0CGwMGCwkIBwMCBhUIAgkKCwQWAgMBAh4BAheAAAoJEGg1 lTBwyZKwLZUP/0dnbhDc229u2u6WtK1s1cSd9WsflGXGagkR6liJ4um3XCfYWDHvIdkHYC1t MNcVHFBwmQkawxsYvgO8kXT3SaFZe4ISfB4K4CL2qp4JO+nJdlFUbZI7cz/Td9z8nHjMcWYF IQuTsWOLs/LBMTs+ANumibtw6UkiGVD3dfHJAOPNApjVr+M0P/lVmTeP8w0uVcd2syiaU5jB aht9CYATn+ytFGWZnBEEQFnqcibIaOrmoBLu2b3fKJEd8Jp7NHDSIdrvrMjYynmc6sZKUqH2 I1qOevaa8jUg7wlLJAWGfIqnu85kkqrVOkbNbk4TPub7VOqA6qG5GCNEIv6ZY7HLYd/vAkVY E8Plzq/NwLAuOWxvGrOl7OPuwVeR4hBDfcrNb990MFPpjGgACzAZyjdmYoMu8j3/MAEW4P0z F5+EYJAOZ+z212y1pchNNauehORXgjrNKsZwxwKpPY9qb84E3O9KYpwfATsqOoQ6tTgr+1BR CCwP712H+E9U5HJ0iibN/CDZFVPL1bRerHziuwuQuvE0qWg0+0SChFe9oq0KAwEkVs6ZDMB2 P16MieEEQ6StQRlvy2YBv80L1TMl3T90Bo1UUn6ARXEpcbFE0/aORH/jEXcRteb+vuik5UGY 5TsyLYdPur3TXm7XDBdmmyQVJjnJKYK9AQxj95KlXLVO38lczsFNBFRjzmoBEACyAxbvUEhd GDGNg0JhDdezyTdN8C9BFsdxyTLnSH31NRiyp1QtuxvcqGZjb2trDVuCbIzRrgMZLVgo3upr MIOx1CXEgmn23Zhh0EpdVHM8IKx9Z7V0r+rrpRWFE8/wQZngKYVi49PGoZj50ZEifEJ5qn/H Nsp2+Y+bTUjDdgWMATg9DiFMyv8fvoqgNsNyrrZTnSgoLzdxr89FGHZCoSoAK8gfgFHuO54B lI8QOfPDG9WDPJ66HCodjTlBEr/Cwq6GruxS5i2Y33YVqxvFvDa1tUtl+iJ2SWKS9kCai2DR 3BwVONJEYSDQaven/EHMlY1q8Vln3lGPsS11vSUK3QcNJjmrgYxH5KsVsf6PNRj9mp8Z1kIG qjRx08+nnyStWC0gZH6NrYyS9rpqH3j+hA2WcI7De51L4Rv9pFwzp161mvtc6eC/GxaiUGuH BNAVP0PY0fqvIC68p3rLIAW3f97uv4ce2RSQ7LbsPsimOeCo/5vgS6YQsj83E+AipPr09Caj 0hloj+hFoqiticNpmsxdWKoOsV0PftcQvBCCYuhKbZV9s5hjt9qn8CE86A5g5KqDf83Fxqm/ vXKgHNFHE5zgXGZnrmaf6resQzbvJHO0Fb0CcIohzrpPaL3YepcLDoCCgElGMGQjdCcSQ+Ci FCRl0Bvyj1YZUql+ZkptgGjikQARAQABwsFfBBgBAgAJBQJUY85qAhsMAAoJEGg1lTBwyZKw l4IQAIKHs/9po4spZDFyfDjunimEhVHqlUt7ggR1Hsl/tkvTSze8pI1P6dGp2XW6AnH1iayn yRcoyT0ZJ+Zmm4xAH1zqKjWplzqdb/dO28qk0bPso8+1oPO8oDhLm1+tY+cOvufXkBTm+whm +AyNTjaCRt6aSMnA/QHVGSJ8grrTJCoACVNhnXg/R0g90g8iV8Q+IBZyDkG0tBThaDdw1B2l asInUTeb9EiVfL/Zjdg5VWiF9LL7iS+9hTeVdR09vThQ/DhVbCNxVk+DtyBHsjOKifrVsYep WpRGBIAu3bK8eXtyvrw1igWTNs2wazJ71+0z2jMzbclKAyRHKU9JdN6Hkkgr2nPb561yjcB8 sIq1pFXKyO+nKy6SZYxOvHxCcjk2fkw6UmPU6/j/nQlj2lfOAgNVKuDLothIxzi8pndB8Jju KktE5HJqUUMXePkAYIxEQ0mMc8Po7tuXdejgPMwgP7x65xtfEqI0RuzbUioFltsp1jUaRwQZ MTsCeQDdjpgHsj+P2ZDeEKCbma4m6Ez/YWs4+zDm1X8uZDkZcfQlD9NldbKDJEXLIjYWo1PH hYepSffIWPyvBMBTW2W5FRjJ4vLRrJSUoEfJuPQ3vW9Y73foyo/qFoURHO48AinGPZ7PC7TF vUaNOTjKedrqHkaOcqB185ahG2had0xnFsDPlx5y In-Reply-To: <20241230175550.4046587-1-riel@surriel.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 255AC120018 X-Rspamd-Server: rspam12 X-Stat-Signature: wt4kgrr38xnjz8dzdzuwctpmfsknmet5 X-Rspam-User: X-HE-Tag: 1736190207-605141 X-HE-Meta: U2FsdGVkX1/zWRSpWMY4GIjQsGKgEuuOspKXi1D4LqfdxRrOgUKBkolcEISmVqj0IX5eJEUGQN+nRRndHOaOW4yCOK44VB9QNMl4742RJPWDX9iSBJHlWf6PMEfDGcbWmU4/KGfF5GQ6ynVBPQjCSrnbBe41SIRkQvcjG/+JoH1EE7nSdFqzErCweCrvAxrXlzKcZUqilrqNiZ360qu6DHywyITmVPLuUKDBC83f0UmlCQ1g1l9g7l7C6i49MpayqxU9I+AsjR9ygL3fsGc711gqT3m4+b+F0rVQXkePxPc9f9wWWEN+5zOTP40/r72IlVLQOz2O4qIHDaDgYnC8ob9y5RQD2P00+P4biy3M7ZCNrTBYocMIqHLttyqXaroFhSVaq/6JfF9nXNTjMKiXkZ2+4bctWjQNkxvuqRdxiWr0clYp13YwYRSf4jD8VP0i4eT5x2N64vsJk2QT8EWJhrZnYLzHIClil7bcuJ/HNHtML7H7Sesyahw82LmJIqrtx0FiV/hkXrk8V8JJv5K8oN5gqZ9tQyWlu6I8ugiNIOFsVevn3VEXt0qfR1eMx+x4c62l0dFscVkQlaskzvs1ktrjXn+r8QUfsPG3vGEZfe8CFhRf+nObVgauxEJcC8863xLE6We61gEfw6x4O9a3GA3EuXLCpuWFQXTxKjHREdVrHZPahwhiHLpyW5F8+fgdStVCIdKkQx7LFE6WFHhkiobcV6X/izAQ2GdVIeFYfh4h3HkE+idV9zRSSsRDvq75EcYi6jyO5vDIEbdphiyFA5c+nSSeojaLtY+8KclBzHAm9j1YR5dXdJud+uTce79L/S2RFweUlyv7uDO9B8dAFZ/KAcYTipnIaJ/5VTQA82yHVIIs/FeCWTZD3iYGS1RkV7y2kZjKb+nM3Q2sqTzZ3eEMbDV3pfBJ2aGHIzz8R4xLxv8s1YFyQT+qFiqFGWMMHPi3aE1bPlCnM8t1IPW l0/3WDyd q0tw8bjN1t/TSXexYHrrSUL84m4KCN/eEdhPDbZHgjcf0lZeLizJbybMAYY6phr7kgvaqBtpVixkpjvy2eOXdYQjR09ZUhDm8HpHq7F4eiZQSvqYTlDb9xOZh/Acl8Gf7iH98D1nompOamsOOPW+/FS620OnyLWgNCMxsKratEUop3WuV0KhVagM1YXD1N89ix9xlsmTynLw8Eyhu/lCBc2q8WgAqUXB0v9QhoF0HMJgyvju+iKZmtqNJfF137Ta3qg1D/0lu+Nt1E8FyZTswBKcFAQ== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: A couple of high level things we need to address: First, I'm OK calling this approach "broadcast TLB invalidation". But I don't think the ASIDs should be called "broadcast ASIDs". I'd much rather that they are called something which makes it clear that they are from a different namespace than the existing ASIDs. After this series there will be three classes: 0: Special ASID used for the kernel, basically 1->TLB_NR_DYN_ASIDS: Allocated from private, per-cpu space. Meaningless when compared between CPUs. >TLB_NR_DYN_ASIDS: Allocated from shared, kernel-wide space. All CPUs share this space and must all agree on what the values mean. The fact that the "shared" ones are system-wide obviously allows INVLPGB to be used. The hardware feature also obviously "broadcasts" things more than plain old INVLPG did. But I don't think that makes the ASIDs "broadcast" ASIDs. It's much more important to know that they are shared across the system instead of per-cpu than the fact that the deep implementation manages them with an instruction that is "broadcast" by hardware. So can we call them "global", "shared" or "system" ASIDs, please? Second, the TLB_NR_DYN_ASIDS was picked because it's roughly the number of distinct PCIDs that the CPU can keep in the TLB at once (at least on Intel). Let's say a CPU has 6 mm's in the per-cpu ASID space and another 6 in the shared/broadcast space. At that point, PCIDs might not be doing much good because the TLB can't store entries for 12 PCIDs. Is there any comprehension in this series? Should we be indexing cpu_tlbstate.ctxs[] by a *context* number rather than by the ASID that it's running as? Last, I'm not 100% convinced we want to do this whole thing. The will-it-scale numbers are nice. But given the complexity of this, I think we need some actual, real end users to stand up and say exactly how this is important in *PRODUCTION* to them.