From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 593D5C10F1D for ; Tue, 13 Dec 2022 13:04:20 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id DCDC38E0003; Tue, 13 Dec 2022 08:04:19 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id D7DF28E0002; Tue, 13 Dec 2022 08:04:19 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id C45B08E0003; Tue, 13 Dec 2022 08:04:19 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0016.hostedemail.com [216.40.44.16]) by kanga.kvack.org (Postfix) with ESMTP id B63FC8E0002 for ; Tue, 13 Dec 2022 08:04:19 -0500 (EST) Received: from smtpin02.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay09.hostedemail.com (Postfix) with ESMTP id 9139980C61 for ; Tue, 13 Dec 2022 13:04:19 +0000 (UTC) X-FDA: 80237301438.02.C476C83 Received: from madras.collabora.co.uk (madras.collabora.co.uk [46.235.227.172]) by imf12.hostedemail.com (Postfix) with ESMTP id A5C0B40003 for ; Tue, 13 Dec 2022 13:04:17 +0000 (UTC) Authentication-Results: imf12.hostedemail.com; dkim=pass header.d=collabora.com header.s=mail header.b=evzGVJCn; spf=pass (imf12.hostedemail.com: domain of usama.anjum@collabora.com designates 46.235.227.172 as permitted sender) smtp.mailfrom=usama.anjum@collabora.com; dmarc=pass (policy=none) header.from=collabora.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1670936658; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=liOBmZ1DNzpLIjnVa/xImEt9qsglQFqsSIruVBxwnDM=; b=nwcmwu+iV3EEP2irLg4Q8YyYsPhQ6Q6YWn9wspAPidJs44TXwL3LARTxJ9nhQpSfB2UMPK KsYZkF55p/svttEy/g0SDrdxaWjlHZp9s2VnO0ORumK5x8KP481fFamFu4RnZhCbHKOcB4 /Z1N0B2uNwBKSkaJWWYNoZgu6iNOWIQ= ARC-Authentication-Results: i=1; imf12.hostedemail.com; dkim=pass header.d=collabora.com header.s=mail header.b=evzGVJCn; spf=pass (imf12.hostedemail.com: domain of usama.anjum@collabora.com designates 46.235.227.172 as permitted sender) smtp.mailfrom=usama.anjum@collabora.com; dmarc=pass (policy=none) header.from=collabora.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1670936658; a=rsa-sha256; cv=none; b=kDvqomWGgEVoZCkRjlAPmy+Gij7M5CyywH/tppYScYVIVMhB8NbH8W/XG4AcvT7MFONT/a 7uVB8GqLoT/aoy6cnUuZQ/2+bhUpIrSBEimnG0w9Iu0CXQXeH48dpfPRogR/DqlR7HtHOV UEXrjd7NBFe/J1IyAV1DDyG1UD5ZYzw= Received: from [192.168.10.9] (unknown [39.45.25.143]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) (Authenticated sender: usama.anjum) by madras.collabora.co.uk (Postfix) with ESMTPSA id 9F4BD6602BE3; Tue, 13 Dec 2022 13:04:09 +0000 (GMT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=collabora.com; s=mail; t=1670936655; bh=DS1yA+9XmvYzYNCforotpq+Ol0Tx/1wHHkNgJvMknJw=; h=Date:Cc:Subject:To:References:From:In-Reply-To:From; b=evzGVJCnpg4Ir/+WqpNCzOcMVx6xdKFUa4HxK6tKwytXZs8UBlg/7Wto83ls+GnYU zEdZHrxsd717cqCl5ylOk2uwbNbNQC/14rcXw86eVcjWPOlOPVSe44c7NdPhOOwX31 tOzXdmwu0QGuOtaB/RoRJs2OzDyLo79VXLFMJcm4/SnCYcxVXeT3QSxKCsEE0RrbFO HSgsGh+cfdTAv5HiH5Lq1zF4m+//KmonD/cqBokq/5WhmPPv64yiCHriPk44pw9la9 ukiD0t0DbNAOyFnmVan/cfDBl0ht7zKWOyqH0P6y3PVCmGqaFvfPIl5oyPevMlz3ou TbscPsB/9K+dg== Message-ID: Date: Tue, 13 Dec 2022 18:04:04 +0500 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.5.0 Cc: Muhammad Usama Anjum , =?UTF-8?B?TWljaGHFgiBNaXJvc8WCYXc=?= , Andrei Vagin , Danylo Mocherniuk , Alexander Viro , Andrew Morton , Suren Baghdasaryan , Greg KH , Christian Brauner , Peter Xu , Yang Shi , Vlastimil Babka , Zach O'Keefe , "Matthew Wilcox (Oracle)" , "Gustavo A. R. Silva" , Dan Williams , kernel@collabora.com, Gabriel Krisman Bertazi , David Hildenbrand , Peter Enderborg , "open list : KERNEL SELFTEST FRAMEWORK" , Shuah Khan , open list , "open list : PROC FILESYSTEM" , "open list : MEMORY MANAGEMENT" , Paul Gofman Subject: Re: [PATCH v6 2/3] fs/proc/task_mmu: Implement IOCTL to get and/or the clear info about PTEs Content-Language: en-US To: Cyrill Gorcunov References: <20221109102303.851281-1-usama.anjum@collabora.com> <20221109102303.851281-3-usama.anjum@collabora.com> From: Muhammad Usama Anjum In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Stat-Signature: qjfwrrbppj8yx9d7hhcwak5tjrkrcx7i X-Rspam-User: X-Rspamd-Queue-Id: A5C0B40003 X-Rspamd-Server: rspam06 X-HE-Tag: 1670936657-329297 X-HE-Meta: U2FsdGVkX19/ECVUI1j2qbqltExHpPjYIJTIqNm3+msZ7JMNhpXavPsJC+PWje8tPW6CRJ/sXDvjOXXYyJI3kibQmbKZl4G8kr58mNwymasRw9d9WifSgpC1+OQ/7c7YBULC4EzrO5Oj1ftFFJ07alzqcVOMmCAcEYCFQtjyHtYItmHoKvpeyYNZNSFs5YSO2IPVIGhqRa/ORHFD5Tlv67aWrd4Ws2Pa/485ikBzQC5l8GlZ4Pru5iYo1sjHUMKTPYmmn9BfYuk85eHmHcZPGJwNt9RWeBzx3y9nx3SY9PgJkACK7wTKYR/FTzTfB6EF9JUT5K7f8OIJnwJU4PsGP3LDwvm+eACkuA7LK18UXnag+uLDIKkJaPacTnzXmVY4z0iUW5TNP9V3uWJyz8YBJfy93SdiX/QoOXKIpgACRZbuTcqDtbC/mD1W9TzW7eWAjY/6KUpdyh3MpkaCUyuC7TnUMnJIQlcS9fbSFXvPWkYHrBxWbyPokU7GC5DCMC/w+YiNyylg4bM0J13KlVgGU5tUD8WENC5+kMSEzL158G7sOgUOIdfnQpMk8Qx1MDq/eTED3IlDoaBxwe/hBreoLZ9j2PHeiXI90fLBRUYA+jEW4NDtTTYxLPQja72gJkjJG6ByqMtmKhr66aOaSf7fmKURGyqDgSm8fzPOP/kPMupBT/YqK+1yTptgCuNyEprDZ6ARQfmY5pqCPgXgEc4wc4zt6rjS9MgYrC27/z7h5Mee/to6NHXhVFLWLmHP9QkPvy7+gycmwLa71Z1e7ZFXKTeajXumHGf9lCszk66HXZSytN0ojGm5LyPUyYHU3tFkyTrSnkmMhHMdxcaQQf5dEC30S2YXSKhWYyt4lgF2foFPLmHslThUudj/AozsCW/O2CzWxN2qz+S966WqgXtz7C5+n4glI6N4O7vK3atNAwv1FOQkKJFqHBLiEl6ECzW/PRZiN/4DkRc= X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On 12/13/22 1:42 AM, Cyrill Gorcunov wrote: > On Wed, Nov 09, 2022 at 03:23:02PM +0500, Muhammad Usama Anjum wrote: > ... >> + >> +static long do_pagemap_sd_cmd(struct mm_struct *mm, struct pagemap_scan_arg *arg) >> +{ >> + struct mmu_notifier_range range; >> + unsigned long __user start, end; >> + struct pagemap_scan_private p; >> + int ret; >> + >> + start = (unsigned long)untagged_addr(arg->start); >> + if ((!IS_ALIGNED(start, PAGE_SIZE)) || (!access_ok((void __user *)start, arg->len))) >> + return -EINVAL; >> + >> + if (IS_GET_OP(arg) && >> + ((arg->vec_len == 0) || (!access_ok((struct page_region *)arg->vec, arg->vec_len)))) >> + return -ENOMEM; >> + >> + if (IS_SD_OP(arg) && ((arg->required_mask & PAGEMAP_NONSD_OP_MASK) || >> + (arg->anyof_mask & PAGEMAP_NONSD_OP_MASK))) >> + return -EINVAL; >> + >> + end = start + arg->len; >> + p.max_pages = arg->max_pages; >> + p.found_pages = 0; >> + p.flags = arg->flags; >> + p.required_mask = arg->required_mask; >> + p.anyof_mask = arg->anyof_mask; >> + p.excluded_mask = arg->excluded_mask; >> + p.return_mask = arg->return_mask; >> + p.vec_index = 0; >> + p.vec_len = arg->vec_len; >> + >> + if (IS_GET_OP(arg)) { >> + p.vec = vzalloc(arg->vec_len * sizeof(struct page_region)); >> + if (!p.vec) >> + return -ENOMEM; >> + } else { >> + p.vec = NULL; >> + } > > Hi Muhammad! I'm really sorry for diving in such late (unfortunatelly too busy to > step in yet). Anyway, while in general such interface looks reasonable here are > few moments which really bothers me: as far as I undertstand you don't need > vzalloc here, plain vmalloc should works as well since you copy only filled > results back to userspace. Thank you for reviewing. Correct, I'll update to use vmalloc. > Next -- there is no restriction on vec_len parameter, > is not here a door for DoS from userspace? Say I could start a number of ioctl > on same pagemap and try to allocate very big amount of vec_len in summay causing > big pressure on kernel's memory. Or I miss something obvious here? Yes, there is a chance that a large chunk of kernel memory can get allocated here as vec_len can be very large. We need to think of limiting this buffer in the current implementation. Any reasonable limit should work. I'm not sure what would be the reasonable limit. Maybe couple of hundred MBs? I'll think about it. Or I should update the implementation such that less amount of intermediate buffer can be used like mincore does. But this can complicate the implementation further as we are already using page ranges instead of keeping just the flags. I'll see what can be done. -- BR, Muhammad Usama Anjum