From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 7EE59C369AB for ; Thu, 24 Apr 2025 07:06:12 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 217506B0006; Thu, 24 Apr 2025 03:06:11 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 1C6F76B0007; Thu, 24 Apr 2025 03:06:11 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 08DEA6B0008; Thu, 24 Apr 2025 03:06:11 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id DF4876B0006 for ; Thu, 24 Apr 2025 03:06:10 -0400 (EDT) Received: from smtpin22.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id F35F21C8415 for ; Thu, 24 Apr 2025 07:06:10 +0000 (UTC) X-FDA: 83368053300.22.C886E4A Received: from mx0a-001b2d01.pphosted.com (mx0a-001b2d01.pphosted.com [148.163.156.1]) by imf26.hostedemail.com (Postfix) with ESMTP id 82E6F14000B for ; Thu, 24 Apr 2025 07:06:08 +0000 (UTC) Authentication-Results: imf26.hostedemail.com; dkim=pass header.d=ibm.com header.s=pp1 header.b=gud8syDm; dmarc=pass (policy=none) header.from=ibm.com; spf=pass (imf26.hostedemail.com: domain of venkat88@linux.ibm.com designates 148.163.156.1 as permitted sender) smtp.mailfrom=venkat88@linux.ibm.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1745478368; a=rsa-sha256; cv=none; b=toMuMjr2o/bZVtqLtQpDkrEWXLiEb1WOT3u2MUyi37lOKtTxHRIVBnfoSY8ufOdsJXrUxV +UMYtEv6MiV+/0gCV7Vzqjkg+CY4Dwjkh/3GojD/wWg99QmspluGI5YhtCJiIZLkfpotbQ vidJtPF6rvuFa+dasV3fpjaMMIurjGs= ARC-Authentication-Results: i=1; imf26.hostedemail.com; dkim=pass header.d=ibm.com header.s=pp1 header.b=gud8syDm; dmarc=pass (policy=none) header.from=ibm.com; spf=pass (imf26.hostedemail.com: domain of venkat88@linux.ibm.com designates 148.163.156.1 as permitted sender) smtp.mailfrom=venkat88@linux.ibm.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1745478368; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=eiiUtSCuRGYYjkG0GYgwAQhmsCQCr7/NSudYGV+4b7A=; b=dtvgcuK9AOXzp8fR9ScPMQyTBGQrijr1Xw3jCUAZBU9Skf9P+6YDjMSATuKl3786MC8T+4 kySOMHXjVHNsSWVw+DS0v2H8p+XxGWo4x+bI9Xn3YqNw7m0aKPTGbrgLlJ0h9hNBG4eRVh p7Z9OH8XfLPCX8xO9p4dAYj1HR9Sbg0= Received: from pps.filterd (m0353729.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.18.1.2/8.18.1.2) with ESMTP id 53NL3Mn1010659; Thu, 24 Apr 2025 07:05:54 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=cc :content-transfer-encoding:content-type:date:from:in-reply-to :message-id:mime-version:references:subject:to; s=pp1; bh=eiiUtS CuRGYYjkG0GYgwAQhmsCQCr7/NSudYGV+4b7A=; b=gud8syDmCqT2Q0rMzkSOF8 RNsITnbvEDBJFwbLZRRGTmye+QCbWoV+ST9JnHap0pgOv9RzZNdzOGsBWvSzsq1k kPBtq3SrYDj08wThMi84JM4Rm5q2pvTZZTpqvCi8b0eWpNYBDNaw7u0fjSH+IT5Z iWIbubCX+nXVzkz4E1kkTtMeY1J0DqRY74gmSttQe5GCnhqohtqI8xtiy61Q9Wiv k8MfBY6fMF3Z/ZF+r5nxaGOJMUalrvoHfc0DlzdgBnnBRh2p/L7GGGwWk6oN05QW 62NQX6fcH7ogj2pK+bmSadjrpk+D5Zmc0uzmNE9bQH+saBROBab3/jxbQ8/5jQwA == Received: from ppma21.wdc07v.mail.ibm.com (5b.69.3da9.ip4.static.sl-reverse.com [169.61.105.91]) by mx0a-001b2d01.pphosted.com (PPS) with ESMTPS id 4677sm231f-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 24 Apr 2025 07:05:54 +0000 (GMT) Received: from pps.filterd (ppma21.wdc07v.mail.ibm.com [127.0.0.1]) by ppma21.wdc07v.mail.ibm.com (8.18.1.2/8.18.1.2) with ESMTP id 53O2l7PK028447; Thu, 24 Apr 2025 07:05:52 GMT Received: from smtprelay02.dal12v.mail.ibm.com ([172.16.1.4]) by ppma21.wdc07v.mail.ibm.com (PPS) with ESMTPS id 466jfvpyd7-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Thu, 24 Apr 2025 07:05:52 +0000 Received: from smtpav05.dal12v.mail.ibm.com (smtpav05.dal12v.mail.ibm.com [10.241.53.104]) by smtprelay02.dal12v.mail.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 53O75paN14484202 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 24 Apr 2025 07:05:52 GMT Received: from smtpav05.dal12v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id DDE4358068; Thu, 24 Apr 2025 07:05:51 +0000 (GMT) Received: from smtpav05.dal12v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 8614758069; Thu, 24 Apr 2025 07:05:42 +0000 (GMT) Received: from [9.61.250.107] (unknown [9.61.250.107]) by smtpav05.dal12v.mail.ibm.com (Postfix) with ESMTP; Thu, 24 Apr 2025 07:05:40 +0000 (GMT) Message-ID: <57892439-7683-43b7-9b03-4553737273b7@linux.ibm.com> Date: Thu, 24 Apr 2025 12:35:36 +0530 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v5 0/2] sched/numa: Skip VMA scanning on memory pinned to one NUMA node via cpuset.mems Content-Language: en-GB To: Libo Chen , akpm@linux-foundation.org, rostedt@goodmis.org, peterz@infradead.org, mgorman@suse.de, mingo@redhat.com, juri.lelli@redhat.com, vincent.guittot@linaro.org, tj@kernel.org, llong@redhat.com Cc: sraithal@amd.com, kprateek.nayak@amd.com, raghavendra.kt@amd.com, yu.c.chen@intel.com, tim.c.chen@intel.com, vineethr@linux.ibm.com, chris.hyser@oracle.com, daniel.m.jordan@oracle.com, lorenzo.stoakes@oracle.com, mkoutny@suse.com, linux-mm@kvack.org, cgroups@vger.kernel.org, linux-kernel@vger.kernel.org References: <20250424024523.2298272-1-libo.chen@oracle.com> From: Venkat Rao Bagalkote In-Reply-To: <20250424024523.2298272-1-libo.chen@oracle.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 X-Proofpoint-Spam-Details-Enc: AW1haW4tMjUwNDI0MDA0MiBTYWx0ZWRfX7aSpWUfcpGQK Grsmq2TPz74F6uGuWD5Hizcvf/dm1gpr4Nx7WIfeHiUIgBWnwsHHxdnW64s0SoJRHIfkv2oGZCM BgeArVA3iv7k+lPrBae49v+2SRjI3676aMUbb+Iu676gp1gowonVI/avl0WNQv0e7CjoMoKiQSa RuViIwX/Nk90kwQWuJfkMFPwhjvc/lyRrTwR+NpDPQq4DDd5c0gj8QFdSsnhYvcXw5aM4GIJXbS yoa/iLvM6a7f7Yrj7W5k79jmgqOv3wQ02p/PwI9tLE5abAwpXcROqROzJ7IMQwdoqxi1hKt1+O+ 4dUy1Pn0ATf23A9+g2hwAB9242yNktsiL4HsLAxcUa4myuMIcObnz/1YtMFKLGpCJ28VCH7t0r9 +n2OtfLTwD2bQ38xjCAhL/ATyW9Mbbvz//+6vSv0l01ZPm2fbx5QRv2ZS1nzAcyXDFq7Dzuh X-Proofpoint-GUID: s25T2sYloqrekRMlvi3K0t_Unia1aQfj X-Proofpoint-ORIG-GUID: s25T2sYloqrekRMlvi3K0t_Unia1aQfj X-Authority-Analysis: v=2.4 cv=CcMI5Krl c=1 sm=1 tr=0 ts=6809e2d2 cx=c_pps a=GFwsV6G8L6GxiO2Y/PsHdQ==:117 a=GFwsV6G8L6GxiO2Y/PsHdQ==:17 a=IkcTkHD0fZMA:10 a=XR8D0OoHHMoA:10 a=VwQbUJbxAAAA:8 a=rOUgymgbAAAA:8 a=yPCof4ZbAAAA:8 a=QyXUC8HyAAAA:8 a=zd2uoN0lAAAA:8 a=EkCr6PPAZoFWUTBO02gA:9 a=QEXdDO2ut3YA:10 a=MP9ZtiD8KjrkvI0BhSjB:22 X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.293,Aquarius:18.0.1099,Hydra:6.0.680,FMLib:17.12.80.40 definitions=2025-04-24_02,2025-04-22_01,2025-02-21_01 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 phishscore=0 malwarescore=0 clxscore=1011 mlxlogscore=999 adultscore=0 priorityscore=1501 mlxscore=0 suspectscore=0 bulkscore=0 spamscore=0 lowpriorityscore=0 impostorscore=0 classifier=spam authscore=0 authtc=n/a authcc= route=outbound adjust=0 reason=mlx scancount=1 engine=8.19.0-2504070000 definitions=main-2504240042 X-Rspam-User: X-Rspamd-Server: rspam02 X-Rspamd-Queue-Id: 82E6F14000B X-Stat-Signature: 9kwa7ix6b4ifdfne7yt1a3rktwp8i5hi X-HE-Tag: 1745478368-295536 X-HE-Meta: U2FsdGVkX1+VTvU4ugLWQHxt8pIEjDFldo0dmlwWREhMV280z3xfXemr8YJtfbIEcmj9BHlGnkwmo/WmPpK1ydEqAqX79/04xflRAvZ2uMwmfImXGq1iafRj2XWTafk81EVNI1ja7IQYvIJ19aE3FbcAREhMe/Q08Z4WQGyXF92Mw2ZIfKLgQXWyIk1eC7Bbw0FN41ItsaXdQ98prdisqzE7caInzIkCu6qkjxsWiGkg/S8O0o8h+fJZf0djq2kxTbLEV57Gp27Y+LcEf3kHIJzskrVhvhbuqKNmEUOy2DLEHy7gBJZLbWIpDjIQq01kRN3gcceLRQBL7XPCgvLurFqAJGZAi0vknU+/iA0yNjdc/xCc+t66rJEDtruwYtjct1nbSFdr6KNto1TouTSdq+kDuOjAkxv0YTwkm1lXrNkVoHX9Bj/JkqX2ui1LxGb0abstV5m+j6EAxNkpVazXd7w9l0iIhn6GDVUNGOB00XXgtRCRhQ6T32gq4S3PR+blkwrODddlT0dE/fvnLWkaVaZWh6OVOUMr0BTOWUfP3DzgpWzkQvorhhiTxLT9fZC9HZ9WYuXPkOsXbBxeVj/1cY7yz+JAYDxwBuuMf20QUnMcFXNXOfN2gby6gtcQJfC47BsZFa/QifJ1C4wl1ngfuAnQx5qMYZu7diKwU+6Rb7gW0gFE9qnKxJI1DYlFMC0KXvcD6EUZ+/FIlCrj22eXwki6zPT11NijfNyQwmosqrN8zqCWotHAuqnHome9jY3Pp3NZDkMdWS0qf0tXTA7spgc//TpQ3DLJT35ny0nwzbj9mXqd7sAviAemNS5WdLyiishA183tz5ebE/EDntkWx8KfSgvDB16ynoF0//1LkPk313aebEMBHqjC9yIYuxFW8rUAhJo7jgE9m5m76clvAy0Cgwb6BV38+zAZfUuEZPsjsTWyFtWIcJA92hfxOLe/5EbONZpfmlLa0Z/FI/b +66hOYoE 0m208HK/2dAAMyiweRg2vZY2ebttPnmOZtmVtyvLtKQoW24PrSnJwqKrYz46f9glSpDJZH0PTlROqlWTG3KZyHtwDfPrU+wRbXFdJJDuBr7HZP0g1NhgLws33xhPS5zG7gPhEKByLJq7347lfVtk0ktSjk7zt/qc53bmZq1mDmmOk9HmF6bt95UwJJEXTNwhBSYN+XL5KWVqM8VRpqg8fj+P6JINrZIYSV7eqOdjNQJPtsRksxEu6mPL7R224QbTu/SaWdAg5dLSNG95Uxw0ROeOM+269HGInWl9UkhK7s+ZMkB0+r/uLhgqtwQ13VA89RRKyJ5TzwKjA2NAmrw3UG4qmsbskgHw5LFQlfr9V21cPdgOTxUWgU2EZxZuIwRJkji5X X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On 24/04/25 8:15 am, Libo Chen wrote: > v1->v2: > 1. add perf improvment numbers in commit log. Yet to find perf diff on > will-it-scale, so not included here. Plan to run more workloads. > 2. add tracepoint. > 3. To peterz's comment, this will make it impossible to attract tasks to > those memory just like other VMA skippings. This is the current > implementation, I think we can improve that in the future, but at the > moment it's probabaly better to keep it consistent. > > v2->v3: > 1. add enable_cpuset() based on Mel's suggestion but again I think it's > redundant. > 2. print out nodemask with %*p.. format in the tracepoint. > > v3->v4: > 1. fix an unsafe dereference of a pointer to content not on ring buffer, > namely mem_allowed_ptr in the tracepoint. > > v4->v5: > 1. add BUILD_BUG_ON() in TP_fast_assign() to guard against future > changes (particularly in size) in nodemask_t. > > Libo Chen (2): > sched/numa: Skip VMA scanning on memory pinned to one NUMA node via > cpuset.mems > sched/numa: Add tracepoint that tracks the skipping of numa balancing > due to cpuset memory pinning > > include/trace/events/sched.h | 33 +++++++++++++++++++++++++++++++++ > kernel/sched/fair.c | 9 +++++++++ > 2 files changed, 42 insertions(+) > Hello Libo, For some reason I am not able to apply this patch. I am trying to test the boot warning[1]. I am trying to apply on top of next-20250423. Below is the error. Am I missing anything? [1]: https://lore.kernel.org/all/20250422205740.02c4893a@canb.auug.org.au/ Error: git am -i v5_20250423_libo_chen_sched_numa_skip_vma_scanning_on_memory_pinned_to_one_numa_node_via_cpuset_mems.mbx Commit Body is: -------------------------- sched/numa: Skip VMA scanning on memory pinned to one NUMA node via cpuset.mems When the memory of the current task is pinned to one NUMA node by cgroup, there is no point in continuing the rest of VMA scanning and hinting page faults as they will just be overhead. With this change, there will be no more unnecessary PTE updates or page faults in this scenario. We have seen up to a 6x improvement on a typical java workload running on VMs with memory and CPU pinned to one NUMA node via cpuset in a two-socket AARCH64 system. With the same pinning, on a 18-cores-per-socket Intel platform, we have seen 20% improvment in a microbench that creates a 30-vCPU selftest KVM guest with 4GB memory, where each vCPU reads 4KB pages in a fixed number of loops. Signed-off-by: Libo Chen Tested-by: Chen Yu Tested-by: K Prateek Nayak -------------------------- Apply? [y]es/[n]o/[e]dit/[v]iew patch/[a]ccept all: a Applying: sched/numa: Skip VMA scanning on memory pinned to one NUMA node via cpuset.mems error: patch failed: kernel/sched/fair.c:3329 error: kernel/sched/fair.c: patch does not apply Patch failed at 0001 sched/numa: Skip VMA scanning on memory pinned to one NUMA node via cpuset.mems Regards, Venkat.