From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5D399C282E5 for ; Wed, 5 Mar 2025 18:39:51 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 055AE28001B; Wed, 5 Mar 2025 13:39:50 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id F201A28000F; Wed, 5 Mar 2025 13:39:49 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id D27D528001B; Wed, 5 Mar 2025 13:39:49 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id A24C228000F for ; Wed, 5 Mar 2025 13:39:49 -0500 (EST) Received: from smtpin23.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id C3BAA1C8AC1 for ; Wed, 5 Mar 2025 18:02:21 +0000 (UTC) X-FDA: 83188266882.23.74F4BB2 Received: from mail-qt1-f179.google.com (mail-qt1-f179.google.com [209.85.160.179]) by imf07.hostedemail.com (Postfix) with ESMTP id 7C63740015 for ; Wed, 5 Mar 2025 18:02:19 +0000 (UTC) Authentication-Results: imf07.hostedemail.com; dkim=pass header.d=google.com header.s=20230601 header.b=1zHYCrkA; dmarc=pass (policy=reject) header.from=google.com; spf=pass (imf07.hostedemail.com: domain of fvdl@google.com designates 209.85.160.179 as permitted sender) smtp.mailfrom=fvdl@google.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1741197739; a=rsa-sha256; cv=none; b=1+TQlzPO6/RXRUOwdmE2IdssvQ3UbPkoXR72wM/3pvlZIwugoyxyjpLL2x9koTH6e4sK0m TeYR9ZyMWEJoeuUH0wdidJFADZ6qTxS02ErYHNzZquluYqMNbeBQoe7og69rY9c/9vflXX vf+u16V3JFO6Vx9ef3jym/oF0ah9zhU= ARC-Authentication-Results: i=1; imf07.hostedemail.com; dkim=pass header.d=google.com header.s=20230601 header.b=1zHYCrkA; dmarc=pass (policy=reject) header.from=google.com; spf=pass (imf07.hostedemail.com: domain of fvdl@google.com designates 209.85.160.179 as permitted sender) smtp.mailfrom=fvdl@google.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1741197739; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=Ce7kUauPnhKfB4I6YLOsGy09Lzeo9MeLVskX7bGtdeI=; b=o9aygtsMhWpl3hwsxAofQtlKjfVB7SU3K5z2DXtw9NphZ7AGRBI//fRnG+pmo39y55AqRv D5GeHh7+MPTkDXdxQw3APppxEbEteRxiMUn6wu2HJ4F8n1WtkAAr3pz+Jib8e1tOeFEM87 B9I2AMyA14Li9OrTGagniDucB+YdfNk= Received: by mail-qt1-f179.google.com with SMTP id d75a77b69052e-474e1b8c935so11021cf.0 for ; Wed, 05 Mar 2025 10:02:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1741197738; x=1741802538; darn=kvack.org; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:from:to:cc:subject:date :message-id:reply-to; bh=Ce7kUauPnhKfB4I6YLOsGy09Lzeo9MeLVskX7bGtdeI=; b=1zHYCrkAfu3gln+U6ZMdx5ibasckbdN/HV/D0PpsCyX3W3wSyNmfxNon3MP+RTHM+w /ej9T9LAZpuyIKUVVnbJ3aXinjiLgjx0Jljt7vPwUDBmeDFq5YdTCmFSbAJxl3RIhpaC Mi53duPevirYsNqiiR7FjDjwZyG5gKL8DutvScuA1hikFRJmEegmAMlgze5yhZTH1n/N krL9EfeaW5O35pQW4vgB4uZtY4LI9iC3B47hMi4VR3bKp4dWqaIp/sibEo9rCeiJ4anG B3B/JQTP5sqtewB/BFRqdtFiW5RdT9TUq/6jM3ILosA1dVdMxJ5pfGmndlSmzCw06xC6 CG0w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1741197738; x=1741802538; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=Ce7kUauPnhKfB4I6YLOsGy09Lzeo9MeLVskX7bGtdeI=; b=mMgL0CuMZfej4KpDATCX6F7x34loIceOIt06FMeXGWiEBezYME7zO0HtoE03XqbHl0 8v8Rpz6WtgugK1rOLjGx7TtNIRi5zQaVLggvMuUtQdgaUR4cpNfc3KuKvifGvs1j0YZ9 Ta4hoEP+ePjWpY4xQL67vhOy3N088oALElrhZ8tHYBhODqGaTQI2rrWow9GDyBcaBHQO tCELIGjPZUJBhXbih4CCpHiVARcL3oJCMQVkNXjQFH43b09N9RywPUSZvFWB2XqBX2vZ IfzGQL6fzSJZEkQFv6XAEdR2/FeW5DM6Jzl1dP3/HtBwDJye/31y990H/yQ7q8y9Ptx3 uUSg== X-Forwarded-Encrypted: i=1; AJvYcCU0ygp2sAHHAwQtc/b0NhJu6jxz7Z0BAagzncjy+bN12wm/gQoC1z5gX6RWdSPuJFcuisN8ngoAYg==@kvack.org X-Gm-Message-State: AOJu0YyP24bM55tzCb80XetLSCaijPJiY/8lLwIhuqRZEV1c9pQynk7k X/GPWIxeySHDW+3XcuscZ34e1Kt/tejbXvoKJHXOEDIntf/GUYIZmC8O0y3WuxtNk3HpHv+CZXO bDikTxVegcT7WqA1VAEHK43BdQ+44LJi1dLM5 X-Gm-Gg: ASbGncuWtotLnMnyWar3eO/gnTF8ks1MRcRvDDYzK7BeDuvkbJALrx2DdYD8WBCb9FC r7daDsdGxg3o4M0i1AzO0unWysRRwzB7y8rlpMoum/+RhoKyWxnLeH0hKiMUWl2gDq6QdNcnOso 0IvCrhZvZ+rZH7U5CpFnWd8Yw= X-Google-Smtp-Source: AGHT+IFA9r40e48P0pUHRPwxiGF25g58Qn7efDLSHlif2uuHza/9Dpfwcb0S4FrEJ4i63iemawEOHXMC3MFNAakXf4o= X-Received: by 2002:a05:622a:1986:b0:474:e97c:4423 with SMTP id d75a77b69052e-4750b55ac9fmr4853601cf.14.1741197738149; Wed, 05 Mar 2025 10:02:18 -0800 (PST) MIME-Version: 1.0 References: <20250228182928.2645936-3-fvdl@google.com> <202503051327.e87dce82-lkp@intel.com> In-Reply-To: <202503051327.e87dce82-lkp@intel.com> From: Frank van der Linden Date: Wed, 5 Mar 2025 10:02:06 -0800 X-Gm-Features: AQ5f1JpHesB0lMosOdwByQXd4BdAnxPB8Po0EQNzICphKLdaoXnJeFsXCTlZgjU Message-ID: Subject: Re: [PATCH v5 02/27] mm, cma: support multiple contiguous ranges, if requested To: kernel test robot Cc: oe-lkp@lists.linux.dev, lkp@intel.com, Arnd Bergmann , linux-mm@kvack.org, linux-kernel@vger.kernel.org, akpm@linux-foundation.org, muchun.song@linux.dev, yuzhao@google.com, usamaarif642@gmail.com, joao.m.martins@oracle.com, roman.gushchin@linux.dev, ziy@nvidia.com, david@redhat.com Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Rspamd-Server: rspam08 X-Rspamd-Queue-Id: 7C63740015 X-Stat-Signature: o1r3bmtf47r81u7whtwn9qgiarqrh6ow X-Rspam-User: X-HE-Tag: 1741197739-746437 X-HE-Meta: U2FsdGVkX1+QB9fkoAESDAcs5+jjjAoDoqD2mnGm4MbM5onR4NdlGnQl7FD12xRNZ20EcKltXiC9q5b34P9Wt9lHFxfdraIuRw4pGuQODCMOoXLTm5GBgz/Q5iupox8Ap6I5gHM64UCxZzJMYwSgbUw5sJEbJSNRSVwme0S+gHF7HjNRt+ZNU7g4Re04klhASvCxBzA7u8M5EIyPNFgo+IjdgkMYZwrrQgkZ47ydDC3CG9vEmcb7hJU1GAj9hHZbE0tqw9Dp0BSkDZbCCaYj6yyjvk6WgGwYKOygSmLBbnxFBi1RU1Hw9RhzamqYdC2o2CLHL3r929itc9dL4UTxfHV5hUAZNxLzVRN+SYmsZeCgMjaafc07Pzs5LsjDHa/YDRZIgQ7jXTlm6uTEXA1lzzHqZc1shfKtLfU5nvp0H0j13p7pTIQAZrqbPEPI8d8Z/Lljnn7A0aajFX4wSST5kAESk1Vmvx1XJqao5493rUhJMn92q6U5yRiAgS5AYltrrCyvQN0AKmhij0JD5uKk1hFsE7W9R9EuOGBYGYMyakVoWF9G5qZ4JbIzvgqnMwVf9No2kqhu6UALBbi2dYx6a5ZSIDv67P29cVaizKViV+vGUVFtrQ9QhAkz45rfPi/CMSufytIB+MQcpM2MwKboTMQH9TbSa7baZH4EJLSqM/9AIq3LfdDdlX/0ahSUW+lYS2e2XHr8m8shvUZKa+6l5Ylon5Hj5NKDsKNim8tTUTnwSdlXuR8O3bZMUdVJP3QDQwrek5EIEMorc1ApXIpKQEv1+HTIRqJ67enI/HFR4lg43WVF4uF5XIKm5NCozTxK/P2aGTEy8vviK+eq2M7X6DDHasWqAO89hpasuXHgtagLIzP68odmVl65L3JujDl4FEtRrLxBtUWo1gOX2udqsqd2nk0AHREpVLC98wbub9LvXEKTagbJI5b0fUbUrH4gIEf+WdZq0e9uoLky1zH elLMvf0k 7/pRKvYjrrY9KvhrRnoD3Wo/R4+J+PmsD8/8QE9QYE2kr1N4b/u2jcEwpmEnkiedlRnIxzopCr6hpJGm1Q9b6ZmS8psOX8PcN1lnCSLuliPdzVw69J9Y/4qTO5Rcv/AzNor+u71aZa780yWQocdZnS774QnXF7/Q39FlGYRVQ0X+8exHbkgZPo6jDezexIWiTWKf25YT5dpquTjV998tKgoHwiJFYJLEkQt+p30UA0U65L2UMtPo71hsdEArquHxtL1Gs4itgIQc3MXigxsJ01h69+AqWY7e8G23o9b/uqdDX3mxFBtlQv8eqBF46pKTXmTRdLPvZS5D2DKi0SG69RTL9T22LIfoG445N X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: On Tue, Mar 4, 2025 at 10:29=E2=80=AFPM kernel test robot wrote: > > > > Hello, > > kernel test robot noticed a 15.1% improvement of netperf.Throughput_tps o= n: > > > commit: a957f140831b0d42e4fdbe83cf93997ef1b51bda ("[PATCH v5 02/27] mm, c= ma: support multiple contiguous ranges, if requested") > url: https://github.com/intel-lab-lkp/linux/commits/Frank-van-der-Linden/= mm-cma-export-total-and-free-number-of-pages-for-CMA-areas/20250301-023339 > base: https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git 276= f98efb64a2c31c099465ace78d3054c662a0f > patch link: https://lore.kernel.org/all/20250228182928.2645936-3-fvdl@goo= gle.com/ > patch subject: [PATCH v5 02/27] mm, cma: support multiple contiguous rang= es, if requested > > testcase: netperf > config: x86_64-rhel-9.4 > compiler: gcc-12 > test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00= GHz (Ice Lake) with 256G memory > parameters: > > ip: ipv4 > runtime: 300s > nr_threads: 200% > cluster: cs-localhost > test: TCP_CRR > cpufreq_governor: performance > > > > > > > Details are as below: > -------------------------------------------------------------------------= -------------------------> > > > The kernel config and materials to reproduce are available at: > https://download.01.org/0day-ci/archive/20250305/202503051327.e87dce82-lk= p@intel.com > > =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D > cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/tb= ox_group/test/testcase: > cs-localhost/gcc-12/performance/ipv4/x86_64-rhel-9.4/200%/debian-12-x86= _64-20240206.cgz/300s/lkp-icl-2sp2/TCP_CRR/netperf > > commit: > cdc31e6532 ("mm/cma: export total and free number of pages for CMA area= s") > a957f14083 ("mm, cma: support multiple contiguous ranges, if requested"= ) > > cdc31e65328522c6 a957f140831b0d42e4fdbe83cf9 > ---------------- --------------------------- > %stddev %change %stddev > \ | \ > 2.43 +0.5 2.90 =C4=85 4% mpstat.cpu.all.usr% > 4718850 +15.4% 5446771 vmstat.system.cs > 62006 =C4=85 43% -59.6% 25067 =C4=85137% numa-meminfo.node= 0.Mapped > 2884295 =C4=85 41% -59.4% 1171696 =C4=85135% numa-meminfo.node= 0.Unevictable > 28159 =C4=85 2% -17.7% 23164 =C4=85 2% perf-c2c.HITM.loc= al > 5426 =C4=85 3% +28.5% 6973 =C4=85 8% perf-c2c.HITM.rem= ote > 33586 =C4=85 2% -10.3% 30137 =C4=85 3% perf-c2c.HITM.tot= al > 5642375 =C4=85 2% +15.5% 6519596 sched_debug.cpu.nr_swi= tches.avg > 7473763 =C4=85 4% +18.0% 8815709 =C4=85 2% sched_debug.cpu.n= r_switches.max > 4352931 =C4=85 3% +12.7% 4906391 =C4=85 2% sched_debug.cpu.n= r_switches.min > 2485115 =C4=85 6% +31.9% 3277456 =C4=85 11% numa-numastat.nod= e0.local_node > 2526446 =C4=85 6% +32.8% 3356120 =C4=85 11% numa-numastat.nod= e0.numa_hit > 3522582 =C4=85 10% +28.7% 4535065 =C4=85 23% numa-numastat.nod= e1.local_node > 3613797 =C4=85 10% +27.0% 4588978 =C4=85 22% numa-numastat.nod= e1.numa_hit > 40617 +5.4% 42811 =C4=85 5% proc-vmstat.nr_slab_re= claimable > 6144430 =C4=85 4% +29.4% 7948120 =C4=85 16% proc-vmstat.numa_= hit > 6011884 =C4=85 4% +30.0% 7815542 =C4=85 16% proc-vmstat.numa_= local > 26402145 =C4=85 2% +40.6% 37129548 =C4=85 14% proc-vmstat.pgall= oc_normal > 25226079 +42.1% 35834032 =C4=85 13% proc-vmstat.pgfree > 15712 =C4=85 43% -59.6% 6348 =C4=85137% numa-vmstat.node0= .nr_mapped > 721073 =C4=85 41% -59.4% 292924 =C4=85135% numa-vmstat.node0= .nr_unevictable > 721073 =C4=85 41% -59.4% 292924 =C4=85135% numa-vmstat.node0= .nr_zone_unevictable > 2526848 =C4=85 6% +32.8% 3355902 =C4=85 11% numa-vmstat.node0= .numa_hit > 2485517 =C4=85 6% +31.9% 3277238 =C4=85 11% numa-vmstat.node0= .numa_local > 3614259 =C4=85 10% +27.0% 4589442 =C4=85 22% numa-vmstat.node1= .numa_hit > 3523043 =C4=85 10% +28.7% 4535533 =C4=85 23% numa-vmstat.node1= .numa_local > 1711802 +15.1% 1969470 netperf.ThroughputBoth_tota= l_tps > 6686 +15.1% 7693 netperf.ThroughputBoth_tps > 1711802 +15.1% 1969470 netperf.Throughput_total_tp= s > 6686 +15.1% 7693 netperf.Throughput_tps > 4.052e+08 =C4=85 5% +16.7% 4.728e+08 =C4=85 4% netperf.time.invo= luntary_context_switches > 535.88 +18.1% 633.12 netperf.time.user_time > 3.175e+08 =C4=85 3% +13.9% 3.615e+08 =C4=85 3% netperf.time.volu= ntary_context_switches > 5.135e+08 +15.1% 5.908e+08 netperf.workload > 0.07 =C4=85 8% -31.3% 0.05 =C4=85 23% perf-sched.sch_de= lay.avg.ms.__cond_resched.kmem_cache_alloc_node_noprof.kmalloc_reserve.__al= loc_skb.tcp_stream_alloc_skb > 0.46 =C4=85114% -71.4% 0.13 =C4=85 34% perf-sched.sch_de= lay.max.ms.__cond_resched.lock_sock_nested.__inet_stream_connect.inet_strea= m_connect.__sys_connect > 5.70 =C4=85 90% +2752.3% 162.72 =C4=85202% perf-sched.wait_a= nd_delay.avg.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64 > 33.94 =C4=85 19% +50.3% 50.99 =C4=85 18% perf-sched.wait_a= nd_delay.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm > 30764 =C4=85 22% -32.1% 20881 =C4=85 22% perf-sched.wait_a= nd_delay.count.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm > 7.03 =C4=85 60% +11736.2% 832.16 =C4=85150% perf-sched.wait_a= nd_delay.max.ms.devkmsg_read.vfs_read.ksys_read.do_syscall_64 > 0.14 =C4=85 8% -33.5% 0.09 =C4=85 26% perf-sched.wait_t= ime.avg.ms.__cond_resched.kmem_cache_alloc_node_noprof.kmalloc_reserve.__al= loc_skb.tcp_stream_alloc_skb > 0.11 =C4=85 8% -14.3% 0.10 =C4=85 11% perf-sched.wait_t= ime.avg.ms.__cond_resched.lock_sock_nested.inet_stream_connect.__sys_connec= t.__x64_sys_connect > 33.61 =C4=85 19% +50.4% 50.57 =C4=85 18% perf-sched.wait_t= ime.avg.ms.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm > 0.69 =C4=85109% -59.0% 0.28 =C4=85 27% perf-sched.wait_t= ime.max.ms.__cond_resched.sk_wait_data.tcp_recvmsg_locked.tcp_recvmsg.inet_= recvmsg > 0.76 -39.9% 0.46 =C4=85 12% perf-stat.i.MPKI > 3.959e+10 +14.9% 4.55e+10 perf-stat.i.branch-instruct= ions > 0.92 -0.0 0.90 perf-stat.i.branch-miss-rat= e% > 3.564e+08 +12.7% 4.017e+08 perf-stat.i.branch-misses > 1.561e+08 -32.2% 1.058e+08 =C4=85 12% perf-stat.i.cache-miss= es > 6.91e+08 -33.8% 4.574e+08 =C4=85 6% perf-stat.i.cache-refe= rences > 4760614 +15.5% 5496803 perf-stat.i.context-switche= s > 1.54 -13.5% 1.33 perf-stat.i.cpi > 2048 +49.1% 3054 =C4=85 9% perf-stat.i.cycles-bet= ween-cache-misses > 2.084e+11 +14.9% 2.394e+11 perf-stat.i.instructions > 0.65 +15.3% 0.75 perf-stat.i.ipc > 37.20 +15.5% 42.97 perf-stat.i.metric.K/sec > 0.75 -41.0% 0.44 =C4=85 12% perf-stat.overall.MPKI > 0.90 -0.0 0.88 perf-stat.overall.branch-mi= ss-rate% > 1.54 -13.6% 1.33 perf-stat.overall.cpi > 2060 +48.5% 3060 =C4=85 10% perf-stat.overall.cycl= es-between-cache-misses > 0.65 +15.7% 0.75 perf-stat.overall.ipc > 3.947e+10 +14.9% 4.536e+10 perf-stat.ps.branch-instruc= tions > 3.553e+08 +12.7% 4.005e+08 perf-stat.ps.branch-misses > 1.557e+08 -32.2% 1.055e+08 =C4=85 12% perf-stat.ps.cache-mis= ses > 6.889e+08 -33.8% 4.56e+08 =C4=85 6% perf-stat.ps.cache-ref= erences > 4746041 +15.5% 5479885 perf-stat.ps.context-switch= es > 2.078e+11 +14.9% 2.387e+11 perf-stat.ps.instructions > 6.363e+13 +14.9% 7.312e+13 perf-stat.total.instruction= s > > > > > Disclaimer: > Results have been estimated based on internal Intel analysis and are prov= ided > for informational purposes only. Any difference in system hardware or sof= tware > design or configuration may affect actual performance. > > > -- > 0-DAY CI Kernel Test Service > https://github.com/intel/lkp-tests/wiki > Since there should be no functional change for existing callers of CMA interfaces, I'm flattered by this report, but it's definitely not these commits that cause any change in performance :-) - Frank