From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id 144C7C54E49 for ; Mon, 4 Mar 2024 09:07:09 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 9D59F6B009E; Mon, 4 Mar 2024 04:07:08 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 95E6D6B00A0; Mon, 4 Mar 2024 04:07:08 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 1F9F66B00A1; Mon, 4 Mar 2024 04:07:06 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0011.hostedemail.com [216.40.44.11]) by kanga.kvack.org (Postfix) with ESMTP id AAF7B6B009E for ; Mon, 4 Mar 2024 04:07:06 -0500 (EST) Received: from smtpin13.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay10.hostedemail.com (Postfix) with ESMTP id 7D8F7C09EA for ; Mon, 4 Mar 2024 09:07:06 +0000 (UTC) X-FDA: 81858777252.13.984AF86 Received: from mail-lf1-f45.google.com (mail-lf1-f45.google.com [209.85.167.45]) by imf15.hostedemail.com (Postfix) with ESMTP id 5512CA0006 for ; Mon, 4 Mar 2024 09:07:04 +0000 (UTC) Authentication-Results: imf15.hostedemail.com; dkim=pass header.d=gmail.com header.s=20230601 header.b=Ns4RQsDF; spf=pass (imf15.hostedemail.com: domain of urezki@gmail.com designates 209.85.167.45 as permitted sender) smtp.mailfrom=urezki@gmail.com; dmarc=pass (policy=none) header.from=gmail.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1709543224; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=4DqcilsZP9EhZLGnuBOtOr+cfSGsEERN5zh397oSQO0=; b=fAdOwUuXoYMWYOyAZBv0TGLywWDwbcvczd9H0YSiycGI43N0DFCNf1wconZhT/uaRLpmg1 stOxCjgI5a8Jkwfflwc3f943q9QjxVpIYbx4X/HeyPUdXVV0mDYX8Fz0WAv5HRVRkVgF7N QvFUPDlb8lPuBxCIVUwcSaLt8OT3noQ= ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1709543224; a=rsa-sha256; cv=none; b=qZdIx3xVYjWzyX1XhhI70nL1eHVekS9CYHtrFLGC5vX64elGIMlYJbFxFuUwOXOnn8IyG5 VjSqYgDvTtzJmkAjIXQefWrzhZNvFzLLtV1n58muGSQFZ8towS+KQPrExeFLVw9JmBH9h9 Ka0tosPo6hEBlYQ+bQpNtUHMJwkb31o= ARC-Authentication-Results: i=1; imf15.hostedemail.com; dkim=pass header.d=gmail.com header.s=20230601 header.b=Ns4RQsDF; spf=pass (imf15.hostedemail.com: domain of urezki@gmail.com designates 209.85.167.45 as permitted sender) smtp.mailfrom=urezki@gmail.com; dmarc=pass (policy=none) header.from=gmail.com Received: by mail-lf1-f45.google.com with SMTP id 2adb3069b0e04-5101cd91017so5814924e87.2 for ; Mon, 04 Mar 2024 01:07:03 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1709543222; x=1710148022; darn=kvack.org; h=in-reply-to:content-transfer-encoding:content-disposition :mime-version:references:message-id:subject:cc:to:date:from:from:to :cc:subject:date:message-id:reply-to; bh=4DqcilsZP9EhZLGnuBOtOr+cfSGsEERN5zh397oSQO0=; b=Ns4RQsDFiVAuluoXfdo9GtrRQoHGr6I0bkji4dqN4vv05mKuQSfzlamSi49r6pVpZo ZlLIKwiHL8CenfGi+t0tmgrm7LGK6W7HqB9lwM7gpGKJSRdiUHsGWPpIst95Rj12V5A9 KegijzE5y6VnergDdTRybpTcD7EhuPDb40KQ1tBHDAEi8PSRrmPQdKDMwUY90Aicth7G IwcHnx+1l+HuURlLd6c3axpoUE77Ejbi6GAYtmQ5vpoWsnt61Eo/evK+1bz2Yh2FSTfd 1BGmSE8+rTI+dG2eQVSg+czgXe/qnOrn65cwyt4FnaekdQ0fcDVkHdbrh1U9PsiuEXkq e3Ig== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1709543222; x=1710148022; h=in-reply-to:content-transfer-encoding:content-disposition :mime-version:references:message-id:subject:cc:to:date:from :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=4DqcilsZP9EhZLGnuBOtOr+cfSGsEERN5zh397oSQO0=; b=LybZ5CqIzIAvkaVhJnVuoEridzizXo0aGulBVQRsrB6/FE/dGPikRO1GnghGFLg9It uzlPJpk+uvxAD2ZRqacJglheWZ9HWesyMddwUBC+t0CUNil1Nyok70iv1f+APC4JHN+M bC4JSPzGp59+ViUHiBITfMB+7WffVV44ZMeWg9OH2MZnWQYNVZGATXX/HP3ffrjbXi7u l3PwmG0R4W/WDRWx5NsYJp7IzSaLE5JSBIpWK+ysA6qG+S3xmF1jbefDoMG8vGsAu1gx Yvtz5AMFmDlgsTLHRi/Lh75tbt7/LJYW+eCtofxBgOteRE1M0fJBWFvPhy4sXoZEuQcQ 8lig== X-Forwarded-Encrypted: i=1; AJvYcCWkzPS+F3eyZYEy4cHHXo0nSWQveMPsLtqVy0iqZOhx7r1WumDdnmrEJIqfgoJVwGF9PPi95nXfVnOwYUO1Gvg45rc= X-Gm-Message-State: AOJu0YysqRWIFjWbqSgwaotlSrYaUSq4fy181s4vJ2UhXqEBHQjZfX8c K8ioJi+KAcJrBdKbBg/vCHrMF7Z3UZjfUESVI41PgK2Q+IZKK4cm X-Google-Smtp-Source: AGHT+IHBUBv0In8u7A05VapXG8lPFOs9sg9a6V3dz1tXpQY1T0ylBgCdoYTCaOfGBWPedz92HXljJg== X-Received: by 2002:ac2:5929:0:b0:513:28c1:99fe with SMTP id v9-20020ac25929000000b0051328c199femr5047817lfi.37.1709543222111; Mon, 04 Mar 2024 01:07:02 -0800 (PST) Received: from pc636 (host-90-235-1-20.mobileonline.telia.com. [90.235.1.20]) by smtp.gmail.com with ESMTPSA id r12-20020a19ac4c000000b005134b21892csm149169lfc.223.2024.03.04.01.07.00 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 04 Mar 2024 01:07:01 -0800 (PST) From: Uladzislau Rezki X-Google-Original-From: Uladzislau Rezki Date: Mon, 4 Mar 2024 10:06:58 +0100 To: kernel test robot Cc: Uladzislau Rezki , oe-lkp@lists.linux.dev, lkp@intel.com, Linux Memory Management List , Andrew Morton , Baoquan He , Christoph Hellwig , Dave Chinner , Joel Fernandes , Kazuhito Hagio , "Liam R. Howlett" , Lorenzo Stoakes , Matthew Wilcox , Oleksiy Avramchenko , "Paul E. McKenney" , ying.huang@intel.com, feng.tang@intel.com, fengwei.yin@intel.com Subject: Re: [linux-next:master] [mm] 8f33a2ff30: stress-ng.resched.ops_per_sec -10.3% regression Message-ID: References: <202402292306.8520763a-oliver.sang@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <202402292306.8520763a-oliver.sang@intel.com> X-Rspamd-Queue-Id: 5512CA0006 X-Rspam-User: X-Rspamd-Server: rspam11 X-Stat-Signature: 5zqmty1mzu4rtw9gb3wbx1wrxkdgy4kj X-HE-Tag: 1709543224-428031 X-HE-Meta: U2FsdGVkX18dNsAKFORjE+S3Dm9xYJgWOAOKNrMlmka25B5x21sDOr0ASFYyogHBl8Bta9XV8NrVj2U/K+fIuavN30Vta/iA6mfd7xind6E5f9FS8yZ8i9OUs2jZWgjPyJwHOj3NxoknedWc3B1wniybXD1RWQWHqZ11l284Q9y2bKIsKyliub7sBOZ5mzykL6mFqjuyIbytqs0UOYmsz9zoD5PC3amDnHTHHcnY3FunAdaMIG0jAJ5uJ5RvHV03pAPus77eG4PigrxyJAAqeoF1eJQFgVTPcWa3NWxKHTZW0srOt/9gIjBKxMBbExHOG7pIE/7lzWoBOfZsMIBMtmQeW2GZRxvRY+zHNrGOXhDnOg1OSaymy1+NGQFoKqJ9s99KlHN4l6oG+KnHsSoDadINJztEogl04d1hrVBt4ZV27icr7R5qzXvYdPhP93XE7VM094KJGNWX6JmP19mTx3FTMKuc76+YGX+mVSlCdvOHz6D6vi+3zF3OXlmX9RQ/6rMclsYnseAbHkbdGtDu5QcPILAlstoUa8Q6XPQ//pz+HgOrdvT3U1SEx2Ha018reNzqzYhOaf/Q22r6nYeBvEWmsCnvqgWQ0icVNKoEvd3c7YuSdwLNHqOZtJSMIMQvDWXy/IOdYC6xYtV6PN+RH4P1wyx/IDir9+dkIM7obuHgON+DBYlxG/ScsZs6/RmugNWHfKTdFPQs6+MmQ58vNTn6TXvfsZ/AHObQ58jzLMAZj6lVHSFH7yvrSGLQp2joCkTl8gU0ghD/85DlXv/eA9sovurjuyUHlpgVlskqYPzmDap2Hle4NBcQBeCGeA1jUmIgG9jNgWgaDfSw97ozmbs2D0eR7j3JEKomAOT5bNrJ46Jt8TknOknxuHRUIMGfiZdFt88qj02WykPH3y8FqOxwbf3m0tmZqGVnoJ0M5oFKlBmsVqYtRqd2NCUrQSKzQ/66XQRwo1vEVwE6wmB /ZKL1/N+ Xuo4nUy6LPbb1YG9vEJVMLKXeRxlfXohyZjoZKDsbwCe7uqapjyx4SG6sifSayxF2JlqqnQtDA0Mprt6sivac5OT/UUYrlITbS98p961OSpsdsZG90AfNDKfOk9rJLBb0D4UShqA7/BQII3w5P3fCK/Q4ZqdnfB9uuKFWI+rs099jlsJdRDuau1eWliYCABgr1xY8sQYPlItfpIqJL84aprlEDNWOSyW2Op2OufuEEa2+MfjHaIFj2vnFNz6Qil0jkYLBm5xYp2lpy2ow1SKLa2Y1oPo+Zt+sJIPYDflLzAiXJbxG4VZplZgxN0uDAQI5jTSkNPwp45K8hPuzNZJB1JtcgjYjIfsBWH4ICEAaDGNK4ZSmpHRoexQ2LBFb9zmwVvTURwun8BFM/kkNgkH+sRWlnxTjg+Se/0jfpPASKbISLZHgefDilSFKH38EEG+qSBmBPeeF/PcsxLvyf6jYoSti/ulupArhmNBmZHIieVb9CP/rZ/DGBsvg3pInwBegR7Jm3WWiXoWjf88JB2DTDjghUl/gI+BK5/PNj3koMtu9Y50raOlpK2f9nudxDiKRspo0XcCH7bq/ObcBVJUKfOTreLmkl+FRUSkt X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Hello. > > Hello, > > kernel test robot noticed a -10.3% regression of stress-ng.resched.ops_per_sec on: > > > commit: 8f33a2ff307248c3e55a7696f60b3658b28edb57 ("mm: vmalloc: set nr_nodes based on CPUs in a system") > https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master > > testcase: stress-ng > test machine: 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory > parameters: > > nr_threads: 100% > testtime: 60s > test: resched > cpufreq_governor: performance > > > In addition to that, the commit also has significant impact on the following tests: > > +------------------+-------------------------------------------------------------------------------------------+ > | testcase: change | stress-ng: stress-ng.pthread.ops_per_sec 23.0% improvement | > | test machine | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory | > | test parameters | cpufreq_governor=performance | > | | nr_threads=100% | > | | test=pthread | > | | testtime=60s | > +------------------+-------------------------------------------------------------------------------------------+ > | testcase: change | stress-ng: stress-ng.fstat.ops_per_sec 14.2% improvement | > | test machine | 64 threads 2 sockets Intel(R) Xeon(R) Gold 6346 CPU @ 3.10GHz (Ice Lake) with 256G memory | > | test parameters | cpufreq_governor=performance | > | | disk=1HDD | > | | fs=xfs | > | | nr_threads=100% | > | | test=fstat | > | | testtime=60s | > +------------------+-------------------------------------------------------------------------------------------+ > > > If you fix the issue in a separate patch/commit (i.e. not just a new version of > the same patch/commit), kindly add following tags > | Reported-by: kernel test robot > | Closes: https://lore.kernel.org/oe-lkp/202402292306.8520763a-oliver.sang@intel.com > > > Details are as below: > --------------------------------------------------------------------------------------------------> > > > The kernel config and materials to reproduce are available at: > https://download.01.org/0day-ci/archive/20240229/202402292306.8520763a-oliver.sang@intel.com > > ========================================================================================= > compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime: > gcc-12/performance/x86_64-rhel-8.3/100%/debian-12-x86_64-20240206.cgz/lkp-icl-2sp7/resched/stress-ng/60s > > commit: > 8e1d743f2c ("mm: vmalloc: support multiple nodes in vmallocinfo") > 8f33a2ff30 ("mm: vmalloc: set nr_nodes based on CPUs in a system") > 8e1d743f2c ("mm: vmalloc: support multiple nodes in vmallocinfo") this commit has nothing to do with this test. > > 8e1d743f2c2671aa 8f33a2ff307248c3e55a7696f60 > ---------------- --------------------------- > %stddev %change %stddev > \ | \ > 7.48 -0.8 6.73 mpstat.cpu.all.nice% > 10439977 -10.4% 9351864 vmstat.system.cs > 14670714 ± 3% +18.1% 17330709 ± 5% numa-numastat.node0.local_node > 14688319 ± 3% +18.1% 17348214 ± 5% numa-numastat.node0.numa_hit > 14538034 ± 3% +15.7% 16824234 ± 4% numa-numastat.node1.local_node > 14556613 ± 3% +15.6% 16834659 ± 4% numa-numastat.node1.numa_hit > 14685240 ± 3% +18.0% 17334251 ± 5% numa-vmstat.node0.numa_hit > 14667635 ± 3% +18.1% 17316745 ± 5% numa-vmstat.node0.numa_local > 14551744 ± 3% +15.6% 16815047 ± 4% numa-vmstat.node1.numa_hit > 14533165 ± 3% +15.6% 16804623 ± 4% numa-vmstat.node1.numa_local > 9.153e+08 -10.3% 8.208e+08 stress-ng.resched.ops > 15220752 -10.3% 13651349 stress-ng.resched.ops_per_sec > 6.584e+08 -10.8% 5.871e+08 stress-ng.time.involuntary_context_switches > I tested the "resched" use case on my setup to check the commit: 8f33a2ff30 ("mm: vmalloc: set nr_nodes based on CPUs in a system") n=0; while [ $n -lt 20 ]; do stress-ng --timeout 60 --times --verify --metrics --no-rand-seed --resched 64; n=$(( $n + 1 )); done 1) One socket system 32 CPUS, 64 threads, 128G of memory: (revert 8f33a2ff30) (with 8f33a2ff30) resched bogo ops/s resched bogo ops/s resched diff % 1105043856 18404843 1110469441 18491268 -0.49 1094766811 18231572 1117884383 18616359 -2.11 1103621287 18376740 1105661054 18411893 -0.18 1079532022 17973123 1101247950 18337844 -2.01 1099874899 18316050 1089695381 18144556 0.93 1076430974 17921542 1074824321 17899317 0.15 1071025136 17835263 1097552346 18276981 -2.48 1092038983 18182772 1103594553 18377955 -1.06 1099140652 18299703 1080602374 17994387 1.69 1100454122 18324364 1094512741 18227744 0.54 1092551777 18195189 1099387884 18305866 -0.63 1098877800 18297198 1095319518 18240721 0.32 1103042823 18366819 1086364199 18090137 1.51 1083722244 18046970 1073436871 17876677 0.95 1101988080 18350823 1080819704 17996891 1.92 1086171084 18087685 1080936227 17998387 0.48 1106178491 18419226 1078155643 17953565 2.53 1084124963 18053216 1087789728 18111601 -0.34 1076017418 17916972 1090240538 18153644 -1.32 1091438151 18174424 1094233215 18221998 -0.26 no difference. 2) Simulated a NUMA system same as your configuration, two nodes with 16 CPUs each, in total 64 threads: Do not post result here since no difference. -- Uladzislau Rezki