From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 48B99C77B76 for ; Mon, 17 Apr 2023 14:15:21 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id A63EC6B0071; Mon, 17 Apr 2023 10:15:20 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id A13526B0072; Mon, 17 Apr 2023 10:15:20 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 8B3D78E0001; Mon, 17 Apr 2023 10:15:20 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0015.hostedemail.com [216.40.44.15]) by kanga.kvack.org (Postfix) with ESMTP id 78DE16B0071 for ; Mon, 17 Apr 2023 10:15:20 -0400 (EDT) Received: from smtpin08.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay03.hostedemail.com (Postfix) with ESMTP id 24D37A04C9 for ; Mon, 17 Apr 2023 14:15:20 +0000 (UTC) X-FDA: 80691080400.08.F4B2511 Received: from NAM04-DM6-obe.outbound.protection.outlook.com (mail-dm6nam04on2054.outbound.protection.outlook.com [40.107.102.54]) by imf22.hostedemail.com (Postfix) with ESMTP id 81415C0035 for ; Mon, 17 Apr 2023 14:15:15 +0000 (UTC) Authentication-Results: imf22.hostedemail.com; dkim=pass header.d=Nvidia.com header.s=selector2 header.b=WrbcrlSR; spf=pass (imf22.hostedemail.com: domain of jgg@nvidia.com designates 40.107.102.54 as permitted sender) smtp.mailfrom=jgg@nvidia.com; dmarc=pass (policy=reject) header.from=nvidia.com; arc=pass ("microsoft.com:s=arcselector9901:i=1") ARC-Seal: i=2; s=arc-20220608; d=hostedemail.com; t=1681740917; a=rsa-sha256; cv=pass; b=LnnFKIUybeDUFRD4dEwrcPFsP9AihJRPS8xUub0WnndPijX78pwlZWUy1fCrxtJF/Rbkn5 iGbpd5YUIDv95U+ZDM9cc9Lm5dtvIQZZBQSX41RanPmNdWDUbFbWKpm5IXBmrMlhHmsV7M FqE2ecvdTwY4le8V/wbKkdToD1qj83c= ARC-Authentication-Results: i=2; imf22.hostedemail.com; dkim=pass header.d=Nvidia.com header.s=selector2 header.b=WrbcrlSR; spf=pass (imf22.hostedemail.com: domain of jgg@nvidia.com designates 40.107.102.54 as permitted sender) smtp.mailfrom=jgg@nvidia.com; dmarc=pass (policy=reject) header.from=nvidia.com; arc=pass ("microsoft.com:s=arcselector9901:i=1") ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1681740917; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=WfrXEzBwWiomCZmP9gORtGauH7ZNalgt/By5l6xbuts=; b=HAKAXZWVfrnzLC+yJj8jEDJ2RQ7wBAwfSTgdSfFw81WFaijKOf9ouadfAO2Z/boWw+pvvw 13VAXr2G5IxDtCVt7vUVhvflILT4QK4vs4zQW8rG1INUK5Hs4UUQfE6AjPIYvIkPyTVrUb x/YWhYfJ+X4E2Qdf87IhiEy390qFDi4= ARC-Seal: i=1; a=rsa-sha256; s=arcselector9901; d=microsoft.com; cv=none; b=Avmozi7HwWqGZgboQpHbahzs7/AsRxEnvE4jMSp0bHP/bPFCGPoPEmml/aOpvlvLr5vPz2yvLpRSIw8shBf3q/oVNsPYvf0697rlAaMtDIkVbw4UcPRR4na7bm54OGmjX9qmjLYS2dCFxreZYnkr1ULdpXLXeSgCXAbnZScaqdfTmCRf984rkNAx46elqct+hED7U9ZYGZF9liqxJhezuB7NENbAQQLBw1bLfmupHApRVIY+P2dri8jC6gnDWyIau5wHSLjqEoEOYkhDxaq82HBh3D1mQCv41qAfjeQx01qtYWXLvE5v/EzcUIsiv/SXjEOEPv60Ldd6a4PqOQF+MA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector9901; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=WfrXEzBwWiomCZmP9gORtGauH7ZNalgt/By5l6xbuts=; b=QflxD0AiPXLld4RU7t7/qk5jz0Bujsw/a20jgU4imM5J4W9HdXlPbCl8wn5mEzyK7odQTjOZgHdyzWVjSJBYLW4MpAjSgpj4adyagEOP6iTgHQ3PvEjAy4cZkXsA8AUzPyXfA84m4PF99sBGdVVs2p58MPaAYkJUv6EMyXtyUe7dNz02WHXuillTynvJXn18D1wCKoYQdlBmz39h/oqubMyYtFEppv6JX8ouJiWnz+SLAyFvcyqyHlchAudbLF6pfOkfSrcw6kRbx9uMwwGK6T0zHUHoBMGOtm/dVsKQwZvEVB8gJNFFdFGo2mwWvRkf7HRwTyrpBVfQymEb5SwJmQ== ARC-Authentication-Results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=nvidia.com; dmarc=pass action=none header.from=nvidia.com; dkim=pass header.d=nvidia.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=Nvidia.com; s=selector2; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-SenderADCheck; bh=WfrXEzBwWiomCZmP9gORtGauH7ZNalgt/By5l6xbuts=; b=WrbcrlSRoJGYW8FWdUw77woHsVcNknmvgkvtLFMcdyp/I/Lry4LMzWT3mrzFNeWSsFHMgICAypWJ/aAp6x7VqXrEPnUCt1f1Yw2uqz3h4twS1goJYugMV72yHME0r2uxc9H8g0fmssVCo0eMT94FG/l+V1/UftKP+AZAT3vh4eec2TbiViCdfLH/ZZRB3Bt25ZUOtYSh/mfzVIVYLYZpXJPN0QILKa7N1Ci0rbIQXob9hzHKS6+ExfsdoMyPT8c6MTt8m/S1kk9mqBIoSjcObcw2lcbNxRc4FRw1KQqCxYRMasGpDexll+juadPA0WoAxdSNxrM/P4eKgAcELDeCPw== Received: from LV2PR12MB5869.namprd12.prod.outlook.com (2603:10b6:408:176::16) by DM4PR12MB5213.namprd12.prod.outlook.com (2603:10b6:5:394::16) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.20.6298.45; Mon, 17 Apr 2023 14:15:12 +0000 Received: from LV2PR12MB5869.namprd12.prod.outlook.com ([fe80::6045:ad97:10b7:62a2]) by LV2PR12MB5869.namprd12.prod.outlook.com ([fe80::6045:ad97:10b7:62a2%9]) with mapi id 15.20.6298.030; Mon, 17 Apr 2023 14:15:11 +0000 Date: Mon, 17 Apr 2023 11:15:10 -0300 From: Jason Gunthorpe To: Lorenzo Stoakes Cc: linux-mm@kvack.org, linux-kernel@vger.kernel.org, Andrew Morton , Matthew Wilcox , David Hildenbrand , Jens Axboe , Pavel Begunkov , io-uring@vger.kernel.org Subject: Re: [PATCH 5/7] io_uring: rsrc: use FOLL_SAME_FILE on pin_user_pages() Message-ID: References: <17357dec04b32593b71e4fdf3c30a346020acf98.1681508038.git.lstoakes@gmail.com> <34959b70-6270-46cf-94c5-d6da12b0c62d@lucifer.local> Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <34959b70-6270-46cf-94c5-d6da12b0c62d@lucifer.local> X-ClientProxiedBy: YT4PR01CA0004.CANPRD01.PROD.OUTLOOK.COM (2603:10b6:b01:d1::9) To LV2PR12MB5869.namprd12.prod.outlook.com (2603:10b6:408:176::16) MIME-Version: 1.0 X-MS-PublicTrafficType: Email X-MS-TrafficTypeDiagnostic: LV2PR12MB5869:EE_|DM4PR12MB5213:EE_ X-MS-Office365-Filtering-Correlation-Id: 7f2dd7b8-baef-4a8f-7150-08db3f4e2835 X-MS-Exchange-SenderADCheck: 1 X-MS-Exchange-AntiSpam-Relay: 0 X-Microsoft-Antispam: BCL:0; X-Microsoft-Antispam-Message-Info: jwIFZQKIo+9fSWhFf+EcUYaJa3ul4hs85jCCiCSxeyJsrZupdh+lToWFzM6ZqsgQ9Z/Ko1j5bXXavJyPljBmchkD/dpAivaSqxyhvUfzjmhzgv7KqsJDrlfm2mKU4WrI5VXEulYgIRDNiheaXPu2m3QdY5nyaO+X3UJRr0WJVlDcsOrLuAz5yH8dD846hjOkel0lm0QJrZ+mCXkbHg1bNhfLnsKxvfZBXmOOxcFxUlKi7/tteWnVhjplz/9VsQgHcj34PQa/yGvDomJIgyJAzKJgbADmEHeuFGx2xKdIE12cSwN5qhASJ7QQl1yf6B2bgT4BUh5sV4KkQYOZsGczeGGpgEhXbLBO8cWta5LmH5UhZDVno8Ak+Or5EZjm80kuYde93fLuFVQqyIqBD1TuV95Pz9ww9mM+1UlWzFBEFzYb/vEPze0/AZ4r75xCUe+vF2/k/YioolUAs///2pbzqLnk11NrL09tOUdncqTmR5qmoCBVZOxIU9i44Zb2Rfzwof5T7xn/yB4dVk10YO8oIXY1OIiod06mjVavtdEOT+eB5ivdx2R/OvVntIBquHrC X-Forefront-Antispam-Report: CIP:255.255.255.255;CTRY:;LANG:en;SCL:1;SRV:;IPV:NLI;SFV:NSPM;H:LV2PR12MB5869.namprd12.prod.outlook.com;PTR:;CAT:NONE;SFS:(13230028)(4636009)(39860400002)(346002)(396003)(366004)(136003)(376002)(451199021)(36756003)(2906002)(5660300002)(8936002)(8676002)(41300700001)(86362001)(38100700002)(478600001)(54906003)(2616005)(26005)(6506007)(6512007)(186003)(6486002)(6916009)(4326008)(66476007)(66556008)(66946007)(316002)(83380400001);DIR:OUT;SFP:1101; X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1 X-MS-Exchange-AntiSpam-MessageData-0: =?us-ascii?Q?mSPY23IUhBa3uivgGO0EkNfMl+Izz8sLcy7zAdzokUK3PeKYH7Lp2OoKM2rL?= =?us-ascii?Q?qqtecas0VldkNttDG//PCy/Djw4VDFhkYm/kGOu9kq30wLmkM0YqS4f69Bb9?= =?us-ascii?Q?rgrQGiNZfKHf4RNYy8RjBU3FpHtwSy/GL0PvoQI3MtE9SlC1fSe+AF66lnZk?= =?us-ascii?Q?0mgpf57P2ZDfdlWO52G4ENasnmmfLjzFE5wpFNURFHGndwXDHHyczsY8ZuiP?= =?us-ascii?Q?k5djl+VXkhUe7qRsn4wUcJGPplzlWPao5f4Mes256OBJQH0eGPU752GOMJf3?= =?us-ascii?Q?5EsH0wEoM0hsCxIZXtN+boyA9fecFGsr12F6JKjlXzdD1h8I3s0Dgvz5zH5a?= =?us-ascii?Q?UID6qw/AdkyZpn5SFgN7+83m33RmfBJBdq4wnZx1pmXKwbUz8UhCX5FuvY/s?= =?us-ascii?Q?3rIM5sj5qnNsZlDwhyL1L/4wAkrHFDCzK/loAy3+dDhBCyqo2CtrWTdgMyP1?= =?us-ascii?Q?/Sg3iLC5Dprm4L9q+SPodAhlHxPBZsX7nESmtBkfzLJrVQ/XOcBw/v7c9gnu?= =?us-ascii?Q?Py6dbbBaFSDYT1z+yFhCtnp3Im0p4cCBFcIgnRf884Cpi7iIXcYmTx7sSIQQ?= =?us-ascii?Q?4YDZbWsy/9H3z2N58CrFk3jO8oFNFdsBTk91vNBlsqSkM7j+29AZnax8YVHb?= =?us-ascii?Q?/8HF8kXfUUDYQCmYfCZW1Siye3ONsX8wwkjcAtH5UsfxgbxfsuJsd4tnobnp?= =?us-ascii?Q?KIYwKpn0POLxOV/DMBbpRzthGMPc+o/xoaR8wE7j68sbGcEXB4STJVSfD4oa?= =?us-ascii?Q?eOWgseuEZmpuBoOslf/49jDdNb4b/fclxgMrsyIn+vgfU0gg5I1DnRkZXIv/?= =?us-ascii?Q?9GZvFN8iGAmQhLRVh7jK1p33yyFkh4tq/YPxINvv0EA0BqJxUDxHy7UMxisa?= =?us-ascii?Q?8BSea6vl02fyMWo2+bmcL3Y5xi6tUPI2o/KXS9qfSSHEtjY2w8BQL5CYS1AB?= =?us-ascii?Q?tqxs7imJJ/x2zx6pEE/7OE+yGkTs7ZaXZ4GwDuYYXhmlqQ7GBjqSribtSQNk?= =?us-ascii?Q?aqPgOJ2EE536JFhXR+mSyy4f1+MCRnsvlsPGyYC1akdWQ7ESSdSVCk4hrNI+?= =?us-ascii?Q?e0JIFMvOEsNaUXK7esyqV/2hvUI1p4F1tZtWx4+sItdDgiujmkNoX0mn1i8g?= =?us-ascii?Q?pyc2jPdYVXEJknoBDVDnicxFfzNmha+ASE6j1f7qc/gqKHwFHPbuFEVqn5mZ?= =?us-ascii?Q?1hBFVEAd824Q+8oYV6tDCrwSw++l4GCypbYm6KUt1A8pAP2deb8cYn6jkZi5?= =?us-ascii?Q?n+1JqgLXWbyPNPhsd9odMJdFSv9ZstbhTzusQtoGESS0T1pehapIs1x12ZBS?= =?us-ascii?Q?+e7sEdZF7DDXnfDUXS4W+nTXXZqr+6q8dcxC/5oKyKeiqj/bUmgmEdAkeWtk?= =?us-ascii?Q?bPZ/opV9SBLA4+UScyK/1bkOKaVn9eIl7GZwjUsx7sx7E6mxYEQ5Y4jx2rzd?= =?us-ascii?Q?bTozhJoy8ZOzY5R9+INiP4tVu1vwFj5dOdJxhIYu1ke1FjYMMQElQEBl2Wkj?= =?us-ascii?Q?SpBqep5mdTK050BxpT8xi0Ii/DsNH88CglPNrgYbNkrV1ijXexb0K2yZduat?= =?us-ascii?Q?hqTgpnsAWKmlbCn8k9l2BkM8XF/+ozbfKWjzR3km?= X-OriginatorOrg: Nvidia.com X-MS-Exchange-CrossTenant-Network-Message-Id: 7f2dd7b8-baef-4a8f-7150-08db3f4e2835 X-MS-Exchange-CrossTenant-AuthSource: LV2PR12MB5869.namprd12.prod.outlook.com X-MS-Exchange-CrossTenant-AuthAs: Internal X-MS-Exchange-CrossTenant-OriginalArrivalTime: 17 Apr 2023 14:15:11.9383 (UTC) X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted X-MS-Exchange-CrossTenant-Id: 43083d15-7273-40c1-b7db-39efd9ccc17a X-MS-Exchange-CrossTenant-MailboxType: HOSTED X-MS-Exchange-CrossTenant-UserPrincipalName: KeroLZLKlk1i4/TAMlgmn1cHiRE/9qzkk+kHfD1rig4Yn9IgDPlb7gUapWyg5Oxx X-MS-Exchange-Transport-CrossTenantHeadersStamped: DM4PR12MB5213 X-Rspam-User: X-Rspamd-Queue-Id: 81415C0035 X-Rspamd-Server: rspam01 X-Stat-Signature: riyq4y8ib71c8x849m7a5ymuhmf7wsnq X-HE-Tag: 1681740915-77895 X-HE-Meta: U2FsdGVkX1+JkdJQOoSV95dxCkaTrT+5ievRlIMjpUpCCtUG1zjONVMNum+hNUz/awUAznfyJx7S+0K8iJbwNeR3Leh/HR9T5kLnz+nP19TttYb3re01xxvO8SQhdOTFpqVIA3PwKrmQUGHSBXKKnYGNK240moC5XnetduqoTdA/nYjFnSiNEDYTg6gJEMSubSLuwni5QzQFCBofH18MZPDoyIcowdadt0OJfDDN9eeuDwM3lJPLfoyL0XxmEVLNpZxSFkz3ea0EiJiRpFS5SQBVfyWr6txzusHaf66otCGP3vWUjWNFe9jmDTsryqTm4nIcpFeX8p9pSQbGMMyJ6Hn5xoHHeBmzVfk7soz5KGzPQXqBAyhLhNEouhE8qrDucvPNELK8L39baKCcme0s5Ps3qYYAAMB9ulG/Z7DqmSV3Itij0TpLNfApxxKYk6Pp4yivBSSbh8MalRgdtKH4nEUenycTFrrpM3O2gLAiexxQ7hkLJju4OnaQ+aTKXbwgMN8vFdY3g3cPUaOWjHWpgrj8oE/3/aUV2PtDETYTmKe9JQwIkv27sI16JPJNT6jjWGtoJJdz85RUIT99l7bGKqqbxJgKawE1Z1L+k2XMv5nwVvHUefSbchJbCMRtPHY84elsRV2qRx5gDhDoyfEZcon9VkRGzF+6MPvhjKRNheCpINyuFp15/lZ/nXGXLESOKIPxLg4VWlB6ap2PFoUcNBzMTV1Qat/2pQUD8ccM0Uw4KEkVTxdrsoRa6t5UnyyfvIfCHMJ97NtVVjOE7XsjVxVhcyso2R+TyAULLHuhOp3VsdTBKc21Oil20dG8heCvOxx0m3MXu8e7sQaGnSVL/ut6/bmeJLv8Kf7cgB98OVd1hVV8wjuQGMpyFR9MMxYqQ5ba+acpSISsZ9ApBypVOV9hFQwqeWzAF0UuQM4QOY76qg6bayFoU/a2vEaFnH4W71TC4mcVJKNDWxX6MWZ K50Zh2To uko+K6nXG88LL47Y1qVefWduvvV4tAuetIZ9svkkPxoKkv7hVVyFSpuEsoc2sRk+trnsadmWIuHcT4/YG3ge65w8U6tin9WHKAgEsC123f70mlzPSTrFk48+Rd5YyPE4atu441zporG1X6IQ4QEmoFwkqHjvzq686TWFsp+ZhnmbyS0SuCbZzLbpyewqWKs6iyL27niSYty5UcCxHOCB17Ez3eEYG4T51GZ0rupUhnGIVm95RTXwUCex4BknMIWa43y9hSjsEtybdWLE2n/2GTi77ucOqE8yzVg9tMNlM5y77tt6EO6yhsT7O3DWHcLOTK8Y9gEcSrit+EFaF8s8+/STQCZQssTX0LVZr X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Mon, Apr 17, 2023 at 03:00:16PM +0100, Lorenzo Stoakes wrote: > On Mon, Apr 17, 2023 at 10:26:09AM -0300, Jason Gunthorpe wrote: > > On Mon, Apr 17, 2023 at 02:19:16PM +0100, Lorenzo Stoakes wrote: > > > > > > I'd rather see something like FOLL_ALLOW_BROKEN_FILE_MAPPINGS than > > > > io_uring open coding this kind of stuff. > > > > > > > > > > How would the semantics of this work? What is broken? It is a little > > > frustrating that we have FOLL_ANON but hugetlb as an outlying case, adding > > > FOLL_ANON_OR_HUGETLB was another consideration... > > > > It says "historically this user has accepted file backed pages and we > > we think there may actually be users doing that, so don't break the > > uABI" > > Having written a bunch here I suddenly realised that you probably mean for > this flag to NOT be applied to the io_uring code and thus have it enforce > the 'anonymous or hugetlb' check by default? Yes > So you mean to disallow file-backed page pinning as a whole unless this > flag is specified? Yes > For FOLL_GET I can see that access to the underlying > data is dangerous as the memory may get reclaimed or migrated, but surely > DMA-pinned memory (as is the case here) is safe? No, it is all broken, read-only access is safe. We are trying to get a point where pin access will interact properly with the filesystem, but it isn't done yet. > Or is this a product more so of some kernel process accessing file-backed > pages for a file system which expects write-notify semantics and doesn't > get them in this case, which could indeed be horribly broken. Yes, broadly > I am definitely in favour of cutting things down if possible, and very much > prefer the use of uaccess if we are able to do so rather than GUP. > > I do feel that GUP should be focused purely on pinning memory rather than > manipulating it (whether read or write) so I agree with this sentiment. Yes, someone needs to be brave enough to go and try to adjust these old places :) I see in the git history this was added to solve CVE-2018-1120 - eg FUSE can hold off fault-in indefinitely. So the flag is really badly misnamed - it is "FOLL_DONT_BLOCK_ON_USERSPACE" and anon memory is a simple, but overly narrow, way to get that property. If it is changed to use kthread_use_mm() it needs a VMA based check for the same idea. Jason