From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id A7B0DC48BF8 for ; Thu, 22 Feb 2024 12:07:50 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id 44F3E6B009B; Thu, 22 Feb 2024 07:07:50 -0500 (EST) Received: by kanga.kvack.org (Postfix, from userid 40) id 4001D6B009C; Thu, 22 Feb 2024 07:07:50 -0500 (EST) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 2C7896B009D; Thu, 22 Feb 2024 07:07:50 -0500 (EST) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0013.hostedemail.com [216.40.44.13]) by kanga.kvack.org (Postfix) with ESMTP id 1D27C6B009B for ; Thu, 22 Feb 2024 07:07:50 -0500 (EST) Received: from smtpin13.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay01.hostedemail.com (Postfix) with ESMTP id EAF911C0E4A for ; Thu, 22 Feb 2024 12:07:49 +0000 (UTC) X-FDA: 81819315858.13.02957CD Received: from mgamail.intel.com (mgamail.intel.com [198.175.65.16]) by imf11.hostedemail.com (Postfix) with ESMTP id E2EDC4001B for ; Thu, 22 Feb 2024 12:07:46 +0000 (UTC) Authentication-Results: imf11.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=BkLAhb09; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf11.hostedemail.com: domain of rulin.huang@intel.com designates 198.175.65.16 as permitted sender) smtp.mailfrom=rulin.huang@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1708603667; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=aJelF+A7FbrcwEIKz4Dcu4qyo1ggiaqhQz9rwK7WuvA=; b=07RROZnZF1MlN9LUWJart+EXRpoL1AHDPfjVAnoqxPMZtfdZwS09YXB+uLC4Dqi6GfH7lJ FBqliL/TClSWmjem0lMAzkBoBmVYd2qaG1oR6ou0lgoCjsVf1+p18na8XhvK1gdlqfNCO6 2W5mSX4lB1FVRkAIlmPvEva4eDGEMCA= ARC-Authentication-Results: i=1; imf11.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=BkLAhb09; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf11.hostedemail.com: domain of rulin.huang@intel.com designates 198.175.65.16 as permitted sender) smtp.mailfrom=rulin.huang@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1708603667; a=rsa-sha256; cv=none; b=SpTK2vs/JfxzZLG7KXWPcBEfhnBHHPbGJ166ZWkciusfy6MkEynKjDiuSChpxpCebxjPaX NmbUz7imjiFkgRvRhAFo2b2yUTBL4iHcgaEGFoTCeQg+RzCT3nya7vv/xwzovjTXhz+ekX SLwYG+t2ATGKycBp2EyYPTDB5BMZ/sw= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1708603667; x=1740139667; h=from:to:cc:subject:date:message-id:in-reply-to: references:mime-version:content-transfer-encoding; bh=OyVyEzm6PAoZwqACVETHMIR8OveG5ztRrupRqvUuqXc=; b=BkLAhb09MINfSm3F+JseqGCrGkN0zgHOzxtLNNobwmu5Y+EjYnvYPgqe ZwlvgQfqXDMmU49XY91G6AYjpNgCJWkdxB2+WunRT92Yl6cPwMnuO0PQT jEYyfwG6R5/+aPCjnuvAyUyFhDODlf8aIsd5pQhM/ef2Z5NuVb2zHmYMX Me9MwaH5CMCajzLxo6EcJkVh3J2cw4Ig7acAB/p4aCF7xiuWiHeDod7Qh 8jzZzZzTg6yNDHMj97GFY8uC7NK7bQxAhqWMuQq3axKf9M4hI+/uZTbvs K8P57o9m1EU17cvyB9p6tjQHHeRZtFu331B20WikpFQu2407PyCJo1wWx g==; X-IronPort-AV: E=McAfee;i="6600,9927,10991"; a="2974023" X-IronPort-AV: E=Sophos;i="6.06,177,1705392000"; d="scan'208";a="2974023" Received: from orviesa006.jf.intel.com ([10.64.159.146]) by orvoesa108.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 22 Feb 2024 04:07:45 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.06,177,1705392000"; d="scan'208";a="5843178" Received: from linux-pnp-server-09.sh.intel.com ([10.239.176.190]) by orviesa006.jf.intel.com with ESMTP; 22 Feb 2024 04:07:42 -0800 From: rulinhuang To: akpm@linux-foundation.org, urezki@gmail.com Cc: colin.king@intel.com, hch@infradead.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, lstoakes@gmail.com, rulin.huang@intel.com, tianyou.li@intel.com, tim.c.chen@intel.com, wangyang.guo@intel.com, zhiguo.zhou@intel.com Subject: Re: [PATCH v3] mm/vmalloc: lock contention optimization under multi-threading Date: Thu, 22 Feb 2024 07:10:45 -0500 Message-ID: <20240222121045.216556-1-rulin.huang@intel.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: References: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Rspam-User: X-Rspamd-Server: rspam12 X-Rspamd-Queue-Id: E2EDC4001B X-Stat-Signature: mpur54rzaz3f845fdmioroemgjt1yrsf X-HE-Tag: 1708603666-38647 X-HE-Meta: U2FsdGVkX18m+/n0ZlSGQkV4RgcZteNc7D/ZPghXaxcdoka5eV69SlyITrk9FoJkL3AC558AErLWOFU9S4FL/yqwD+vtyLFvi4Mtp397Du4b6vQDGoMoOpOLO+U88vsdXYNU9gyN9hV05LYhN5pHtZQGWaIBO6cZlH25CuebOcihylGVAVZAlZZhszodTI2xE1V1nE49xs0CCIBYCVSlXJypOVBRxIb5REtrZ1UwI6OPOa/qaxx0BUNF0ALZCTJc7WUM56WVNgQPTBnSlQ7ddpi6dNjeoX8DUFr8dsdL+oyGAdD3lY1tWxN2I/1klhGLY00sikGCUFVu8FA1i4LG2XjGsr/Uwfc9xKRQr2Cs/hpLzCr631JHTNOt6NjNKdpC47b2ddCiVzl/g6sLvITzi5Qk2fEWJ/AajJ3dn5yeev2t8rHZyA/AcyRTUp5UfJQpJa642zcGVc1Xq7EEt2koxWg04KDemMrq2noNvY18C/Ixszxh8XEjgqhqDpIAkGuNaIrBf/XTieGnAG2+QhbJ/D92nsBb6ruKiKUjt6WBQ2MOXWXur97w3rVe1dZ6CTGnXehIgMElMxa8J9yu5sIqb4nxyxpYkABdYC5D806v1f1BmF5VClE/uUIC7k5RPvTiSMR4Z9mCyHd8R/Hx47Y1Xjs9XkGhnISaj0ydkl+K9TW1HeJXXtGim0ysSmPtx+27CiNCHXFb2riKYWSp3PAoT5QFbnbPokl5WIZsO2TU252v8lcYTC9CJ3iHEPPaggIElSv/Xx1bjxMEMOER3WPN79u5P18KvMSOhfsu5Jh1SkYy38kZFgCd7r2YGTf9tPMVEqCmjKefMBsL4fw7Lmxhzf/tyoazL81gSBGaR0iDbupE+k+e/cye06qIZIlaCGkL14ncrixQwyPSnteHV1rQ/HVPKoMupRCx4i8sBgQV/iV/fvcl3MBQUOhFKUOZnRQmLMEJuHlfglml6TY6DJx EiSY7k3S ANzclUBUlpHT6wvLnmjP5bBirxj0bbFsO0wxjypeu9XY+OlG3clgNmnoTw4RvoEPFDSUB/276dYOCKcRLc68ROZygG6myDXxbneoIg8aE81fb4UYYwhEEX7X5yLTHWDLJl+e4Tn2tDWC6F28pUtKFnBQnbw== X-Bogosity: Ham, tests=bogofilter, spamicity=0.000006, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: List-Subscribe: List-Unsubscribe: Hi Uladzislau and Andrew, we have rebased it(Patch v4) on branch mm-unstable and remeasured it. Could you kindly help confirm if this is the right base to work on? Compared to the previous result at kernel v6.7 with a 5% performance gain on intel icelake(160 vcpu), we only had a 0.6% with this commit base. But we think our modification still has some significance. On the one hand, this does reduce a critical section. On the other hand, we have a 4% performance gain on intel sapphire rapids(224 vcpu), which suggests more performance improvement would likely be achieved when the core count of processors increases to hundreds or even thousands. Thank you again for your comments.