From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id D76B9C2BB85 for ; Fri, 21 Jun 2024 16:12:02 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 357798D017F; Fri, 21 Jun 2024 12:12:02 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 2DF4C8D0170; Fri, 21 Jun 2024 12:12:02 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 0F1208D017F; Fri, 21 Jun 2024 12:12:02 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0017.hostedemail.com [216.40.44.17]) by kanga.kvack.org (Postfix) with ESMTP id D66F28D0170 for ; Fri, 21 Jun 2024 12:12:01 -0400 (EDT) Received: from smtpin11.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay03.hostedemail.com (Postfix) with ESMTP id 5AB5DA10CF for ; Fri, 21 Jun 2024 16:12:01 +0000 (UTC) X-FDA: 82255387242.11.CB542BF Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.14]) by imf06.hostedemail.com (Postfix) with ESMTP id 4DCFC18000E for ; Fri, 21 Jun 2024 16:11:58 +0000 (UTC) Authentication-Results: imf06.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b="D2Yj2/tQ"; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf06.hostedemail.com: domain of dave.hansen@intel.com designates 198.175.65.14 as permitted sender) smtp.mailfrom=dave.hansen@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1718986311; a=rsa-sha256; cv=none; b=G3UOXxopiosDh6ybdHBhIksyOzwx1aP4CmmaiSobSfLwequhrTgX2Q4rT0Pxoaion0+QMa 3dk5zqvnq8wr+3zoYdriE3lyJGTNnuHBSVhAV9dk8AH5xv73cQsC+xFMY//sATdr1kyJOS HhRfCRGMwap12JTmldhmnCrGBUYr1s0= ARC-Authentication-Results: i=1; imf06.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b="D2Yj2/tQ"; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf06.hostedemail.com: domain of dave.hansen@intel.com designates 198.175.65.14 as permitted sender) smtp.mailfrom=dave.hansen@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1718986311; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=It5wVth+j5+zxstyh+vNsR4DpZ4lrXIsArnyvR4GMZg=; b=Omvfsu1NvgZrMC8oWq5u7xH9tMsIAI55Zqqu2NSTQZuPZQRlMGTeNOlduwGPHjPkkg9pFG Dc7eDKul7ROOZhPFElfvJ6PSbdfQJlkVwKCUyZ06l4nE8jWtLMhke+3iiUzb7ihxFayf/7 g5a4LUb/FzMuBWV24fyvODgHXD+5Krg= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1718986318; x=1750522318; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=BUMfaAo/KwHaw6iSwPgLnlv+IyqhQhDWdAdRUyVOJXc=; b=D2Yj2/tQHSwe4iPq1I2vyqCqriXXnHRMME/sQgWIuEXLQUekLO2VLfKG 6l5CWIJdM8wKsb4p3mDYawVpZZsQEI3uatzt25j8iIQF18BG/9VcK4+Y5 euE07X5OcM4aLGFBBAr9DNMT+i2GvuR29x2hA5LnGMnk1qMn4kICRsaJg Y+gUpFJ5Wx/UK+IAueMeT3Sc+IKDChUT8YxreqQ6CoRdpbJdaxDqVpgMp +XcKTExHEEdkRD3O0ZYEORzJvuxVbCB1fDWntPykrU7syAi3OzXHHHq5F bHOWSTmHbIs9sgbwmnheqsRxbNZ4rzNOxEqrFFDjcG55bCDmQb7t++9+4 A==; X-CSE-ConnectionGUID: iGwlIS2BR46y8YiXlujaVA== X-CSE-MsgGUID: XZrzRbVJS0O/Vibpa4sDTg== X-IronPort-AV: E=McAfee;i="6700,10204,11110"; a="19840012" X-IronPort-AV: E=Sophos;i="6.08,255,1712646000"; d="scan'208";a="19840012" Received: from fmviesa008.fm.intel.com ([10.60.135.148]) by orvoesa106.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 21 Jun 2024 09:11:56 -0700 X-CSE-ConnectionGUID: Vq656qevQCOneX+OEz/oVw== X-CSE-MsgGUID: xTH7aQt2SYGS+mPlibylGw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.08,255,1712646000"; d="scan'208";a="42598166" Received: from bmurrell-mobl.amr.corp.intel.com (HELO [10.124.221.70]) ([10.124.221.70]) by fmviesa008-auth.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 21 Jun 2024 09:11:56 -0700 Message-ID: <5daa82f5-b5ea-42d7-a074-122734ad99db@intel.com> Date: Fri, 21 Jun 2024 09:11:56 -0700 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH 6/7] mm/x86: Add missing pud helpers To: Peter Xu Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org, x86@kernel.org, Borislav Petkov , Dave Jiang , "Kirill A . Shutemov" , Ingo Molnar , Oscar Salvador , Matthew Wilcox , Vlastimil Babka , Dan Williams , Andrew Morton , Hugh Dickins , Michael Ellerman , Dave Hansen , Thomas Gleixner , linuxppc-dev@lists.ozlabs.org, Christophe Leroy , Rik van Riel , Mel Gorman , "Aneesh Kumar K . V" , Nicholas Piggin , Huang Ying , "Edgecombe, Rick P" References: <20240621142504.1940209-1-peterx@redhat.com> <20240621142504.1940209-7-peterx@redhat.com> <4fb4b087-cae2-4516-a34e-cb4c72be13eb@intel.com> From: Dave Hansen Content-Language: en-US Autocrypt: addr=dave.hansen@intel.com; keydata= xsFNBE6HMP0BEADIMA3XYkQfF3dwHlj58Yjsc4E5y5G67cfbt8dvaUq2fx1lR0K9h1bOI6fC oAiUXvGAOxPDsB/P6UEOISPpLl5IuYsSwAeZGkdQ5g6m1xq7AlDJQZddhr/1DC/nMVa/2BoY 2UnKuZuSBu7lgOE193+7Uks3416N2hTkyKUSNkduyoZ9F5twiBhxPJwPtn/wnch6n5RsoXsb ygOEDxLEsSk/7eyFycjE+btUtAWZtx+HseyaGfqkZK0Z9bT1lsaHecmB203xShwCPT49Blxz VOab8668QpaEOdLGhtvrVYVK7x4skyT3nGWcgDCl5/Vp3TWA4K+IofwvXzX2ON/Mj7aQwf5W iC+3nWC7q0uxKwwsddJ0Nu+dpA/UORQWa1NiAftEoSpk5+nUUi0WE+5DRm0H+TXKBWMGNCFn c6+EKg5zQaa8KqymHcOrSXNPmzJuXvDQ8uj2J8XuzCZfK4uy1+YdIr0yyEMI7mdh4KX50LO1 pmowEqDh7dLShTOif/7UtQYrzYq9cPnjU2ZW4qd5Qz2joSGTG9eCXLz5PRe5SqHxv6ljk8mb ApNuY7bOXO/A7T2j5RwXIlcmssqIjBcxsRRoIbpCwWWGjkYjzYCjgsNFL6rt4OL11OUF37wL QcTl7fbCGv53KfKPdYD5hcbguLKi/aCccJK18ZwNjFhqr4MliQARAQABzUVEYXZpZCBDaHJp c3RvcGhlciBIYW5zZW4gKEludGVsIFdvcmsgQWRkcmVzcykgPGRhdmUuaGFuc2VuQGludGVs LmNvbT7CwXgEEwECACIFAlQ+9J0CGwMGCwkIBwMCBhUIAgkKCwQWAgMBAh4BAheAAAoJEGg1 lTBwyZKwLZUP/0dnbhDc229u2u6WtK1s1cSd9WsflGXGagkR6liJ4um3XCfYWDHvIdkHYC1t MNcVHFBwmQkawxsYvgO8kXT3SaFZe4ISfB4K4CL2qp4JO+nJdlFUbZI7cz/Td9z8nHjMcWYF IQuTsWOLs/LBMTs+ANumibtw6UkiGVD3dfHJAOPNApjVr+M0P/lVmTeP8w0uVcd2syiaU5jB aht9CYATn+ytFGWZnBEEQFnqcibIaOrmoBLu2b3fKJEd8Jp7NHDSIdrvrMjYynmc6sZKUqH2 I1qOevaa8jUg7wlLJAWGfIqnu85kkqrVOkbNbk4TPub7VOqA6qG5GCNEIv6ZY7HLYd/vAkVY E8Plzq/NwLAuOWxvGrOl7OPuwVeR4hBDfcrNb990MFPpjGgACzAZyjdmYoMu8j3/MAEW4P0z F5+EYJAOZ+z212y1pchNNauehORXgjrNKsZwxwKpPY9qb84E3O9KYpwfATsqOoQ6tTgr+1BR CCwP712H+E9U5HJ0iibN/CDZFVPL1bRerHziuwuQuvE0qWg0+0SChFe9oq0KAwEkVs6ZDMB2 P16MieEEQ6StQRlvy2YBv80L1TMl3T90Bo1UUn6ARXEpcbFE0/aORH/jEXcRteb+vuik5UGY 5TsyLYdPur3TXm7XDBdmmyQVJjnJKYK9AQxj95KlXLVO38lczsFNBFRjzmoBEACyAxbvUEhd GDGNg0JhDdezyTdN8C9BFsdxyTLnSH31NRiyp1QtuxvcqGZjb2trDVuCbIzRrgMZLVgo3upr MIOx1CXEgmn23Zhh0EpdVHM8IKx9Z7V0r+rrpRWFE8/wQZngKYVi49PGoZj50ZEifEJ5qn/H Nsp2+Y+bTUjDdgWMATg9DiFMyv8fvoqgNsNyrrZTnSgoLzdxr89FGHZCoSoAK8gfgFHuO54B lI8QOfPDG9WDPJ66HCodjTlBEr/Cwq6GruxS5i2Y33YVqxvFvDa1tUtl+iJ2SWKS9kCai2DR 3BwVONJEYSDQaven/EHMlY1q8Vln3lGPsS11vSUK3QcNJjmrgYxH5KsVsf6PNRj9mp8Z1kIG qjRx08+nnyStWC0gZH6NrYyS9rpqH3j+hA2WcI7De51L4Rv9pFwzp161mvtc6eC/GxaiUGuH BNAVP0PY0fqvIC68p3rLIAW3f97uv4ce2RSQ7LbsPsimOeCo/5vgS6YQsj83E+AipPr09Caj 0hloj+hFoqiticNpmsxdWKoOsV0PftcQvBCCYuhKbZV9s5hjt9qn8CE86A5g5KqDf83Fxqm/ vXKgHNFHE5zgXGZnrmaf6resQzbvJHO0Fb0CcIohzrpPaL3YepcLDoCCgElGMGQjdCcSQ+Ci FCRl0Bvyj1YZUql+ZkptgGjikQARAQABwsFfBBgBAgAJBQJUY85qAhsMAAoJEGg1lTBwyZKw l4IQAIKHs/9po4spZDFyfDjunimEhVHqlUt7ggR1Hsl/tkvTSze8pI1P6dGp2XW6AnH1iayn yRcoyT0ZJ+Zmm4xAH1zqKjWplzqdb/dO28qk0bPso8+1oPO8oDhLm1+tY+cOvufXkBTm+whm +AyNTjaCRt6aSMnA/QHVGSJ8grrTJCoACVNhnXg/R0g90g8iV8Q+IBZyDkG0tBThaDdw1B2l asInUTeb9EiVfL/Zjdg5VWiF9LL7iS+9hTeVdR09vThQ/DhVbCNxVk+DtyBHsjOKifrVsYep WpRGBIAu3bK8eXtyvrw1igWTNs2wazJ71+0z2jMzbclKAyRHKU9JdN6Hkkgr2nPb561yjcB8 sIq1pFXKyO+nKy6SZYxOvHxCcjk2fkw6UmPU6/j/nQlj2lfOAgNVKuDLothIxzi8pndB8Jju KktE5HJqUUMXePkAYIxEQ0mMc8Po7tuXdejgPMwgP7x65xtfEqI0RuzbUioFltsp1jUaRwQZ MTsCeQDdjpgHsj+P2ZDeEKCbma4m6Ez/YWs4+zDm1X8uZDkZcfQlD9NldbKDJEXLIjYWo1PH hYepSffIWPyvBMBTW2W5FRjJ4vLRrJSUoEfJuPQ3vW9Y73foyo/qFoURHO48AinGPZ7PC7TF vUaNOTjKedrqHkaOcqB185ahG2had0xnFsDPlx5y In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Rspamd-Queue-Id: 4DCFC18000E X-Rspam-User: X-Rspamd-Server: rspam05 X-Stat-Signature: i6wp6fwp64r3f6pnjh6474yq8da6febi X-HE-Tag: 1718986318-872774 X-HE-Meta: U2FsdGVkX19/LMH81gUtliELTOhwGChnjGvxfWrqukt6DdcsfZinaw+MxI56KmewVZu7vp9tuxqQ4OePHurc6jzKE2tQTKFDeehjnjoBxX3ZuHOkBJKw4lG/JI72JxtNM+tXT13yNNa9+3h4ZFYG/3cI0ZhUHlJ52tyW2JFBgN9nhhFVlCGrSsD+o3C54qO12uDZFbZtMRV6Lo6C5VEKRrdYInbZuwXuVtNjbwFFRyVnq4jAUklythwh6A0gRrPEwafYqgedPSFpcSjchHuKQOKT/mqMmhwuU4sru9LM4vlk1XRj7u1R6qf+wm+h5H2IsezTr1EYt7sTIbetjGHGmgAh6oKG0so87Pq/Y+tVk8AYEmq2E7kdntD1TY/afAKoQXlUPZucb1nBtkxuXNLk0elBi0d8WsqtgYTNkm75e5r38uIVCd6jnFLGaUj59iXMaodh3VSdMstBVoSjDBAQDQTYv4YKHwKxp4iJUI28qpEou+YRhuI002/ZzG9XNSMYnF3zIe7RhZAUYPiJzJqTUB59gHfdPY2sqVGCeUmMLM61cASO5tKeFouQ/txHwCb+HrPS8Mdyuvo6UP8IU95FUrxMrFQ+nHybQIMDIyYvVQcRO5QJ0Pn+yy4FShYMfreI5OLjp+TXksQwev8cF8iFuyV56sN/J1gPq3Wbnl8BHP/HP8wdVqsHnymeYK1SDQi4uHFlqGIugW1BXBmdS8fOXNOey5G04UZZ4Px2DuvzdOpuzVTgW48TOfsvwelx1ceYJ+s95tGuqNmd3BtOS7qM/42SS1s6hOzv2k3n2K40IRn8HdAwimQ4R9D5seiHZqlWQyf0IKAfoMRF+Tc9DVqW8GJxJlaNrwXr2SW3xxv/CaNTWoJpCYgaOOlFra+AgiXVum3GSptp1GaGn23J5bHqrNP8FXnA5waklp6Ab1OJjTn/pYgU9zJg05CeOWtE+2KOvvYTIEEXSAsFu8Ve3cR uK7u5Gh0 6CNpJswXAVSBp/x6xDotFCHaFbtjP61iORoKZ012KOA8IegF8h+JzY6w4BDzdAQqKO8dCqx7UVrEiUiaQ/kSiASRiUu6FTPc+z5vDYLXS08XqX73lz1+ICBT2jJW+XLUmdXnuTtj31I+O0c4n1p8tkpjH2ReB2SNWvYnYEHYyZ+Jzv88rOHs4vnhCFA== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 6/21/24 08:45, Peter Xu wrote: > On Fri, Jun 21, 2024 at 07:51:26AM -0700, Dave Hansen wrote: ... >> But, still, what if you take a Dirty=1,Write=1 pud and pud_modify() it >> to make it Dirty=1,Write=0? What prevents that from being >> misinterpreted by the hardware as being a valid 1G shadow stack mapping? > > Thanks for pointing that out. I think I was thinking it will only take > effect on VM_SHADOW_STACK first, so it's not? > > I was indeed trying to find more information on shadow stack at that time > but I can't find as much on the pgtable implications, on e.g. whether "D=1 > + W=0" globally will be recognized as shadow stack. At least on SDM March > 2024 version Vol3 Chap4 pgtable entries still don't explain these details, > or maybe I missed it. Please let me know if there's suggestion on what I > can read before I post a v2. It's in the "Determination of Access Rights" section. A linear address is a shadow-stack address if the following are true of the translation of the linear address: (1) the R/W flag (bit 1) is 0 and the dirty flag (bit 6) is 1 in the paging- structure entry that maps the page containing the linear address; and (2) the R/W flag is 1 in every other paging- structure entry controlling the translation of the linear address. > So if it's globally taking effect, indeed we'll need to handle them in PUDs > too. > > Asides, not sure whether it's off-topic to ask here, but... why shadow > stack doesn't reuse an old soft-bit to explicitly mark "this is shadow > stack ptes" when designing the spec? Now it consumed bit 58 anyway for > caching dirty. IIUC we can avoid all these "move back and forth" issue on > dirty bit if so. The design accommodates "other" OSes that are using all the software bits for other things. For Linux, you're right, we just ended up consuming a software bit _anyway_ so we got all the complexity of the goofy permissions *AND* lost a bit in the end. Lose, lose. >>> /* >>> * mprotect needs to preserve PAT and encryption bits when updating >>> * vm_page_prot >>> @@ -1377,10 +1398,25 @@ static inline pmd_t pmdp_establish(struct vm_area_struct *vma, >>> } >>> #endif >>> >>> +static inline pud_t pudp_establish(struct vm_area_struct *vma, >>> + unsigned long address, pud_t *pudp, pud_t pud) >>> +{ >>> + if (IS_ENABLED(CONFIG_SMP)) { >>> + return xchg(pudp, pud); >>> + } else { >>> + pud_t old = *pudp; >>> + WRITE_ONCE(*pudp, pud); >>> + return old; >>> + } >>> +} >> >> Why is there no: >> >> page_table_check_pud_set(vma->vm_mm, pudp, pud); >> >> ? Sure, it doesn't _do_ anything today. But the PMD code has it today. >> So leaving it out creates a divergence that honestly can only serve to >> bite us in the future and will create a head-scratching delta for anyone >> that is comparing PUD and PMD implementations in the future. > > Good question, I really don't remember why I didn't have that, since I > should have referenced the pmd helper. I'll add them and see whether I'll > hit something otherwise. > > Thanks for the review. One big thing I did in this review was make sure that the PMD and PUD helpers were doing the same thing. Would you mind circling back and double-checking the same before you repost this?