From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by smtp.lore.kernel.org (Postfix) with ESMTP id AF87EEB64DA for ; Tue, 11 Jul 2023 00:39:19 +0000 (UTC) Received: by kanga.kvack.org (Postfix) id A56928E0001; Mon, 10 Jul 2023 20:39:18 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id 9EA198D0001; Mon, 10 Jul 2023 20:39:18 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id 8A6468E0001; Mon, 10 Jul 2023 20:39:18 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from relay.hostedemail.com (smtprelay0010.hostedemail.com [216.40.44.10]) by kanga.kvack.org (Postfix) with ESMTP id 7350C8D0001 for ; Mon, 10 Jul 2023 20:39:18 -0400 (EDT) Received: from smtpin06.hostedemail.com (a10.router.float.18 [10.200.18.1]) by unirelay09.hostedemail.com (Postfix) with ESMTP id 3C178802A5 for ; Tue, 11 Jul 2023 00:39:18 +0000 (UTC) X-FDA: 80997471996.06.A2F82C7 Received: from mga03.intel.com (mga03.intel.com [134.134.136.65]) by imf01.hostedemail.com (Postfix) with ESMTP id 7F5CD40009 for ; Tue, 11 Jul 2023 00:39:15 +0000 (UTC) Authentication-Results: imf01.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=B9vlPoNf; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf01.hostedemail.com: domain of ying.huang@intel.com designates 134.134.136.65 as permitted sender) smtp.mailfrom=ying.huang@intel.com ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=hostedemail.com; s=arc-20220608; t=1689035956; h=from:from:sender:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type:content-transfer-encoding: in-reply-to:in-reply-to:references:references:dkim-signature; bh=Mp9mpsN+rGLqxk8y9dTDjNn6hZRqQM7WXtGqp49fOF0=; b=bTtfXgtTPD7LRlsbsn/55gMyn1Oq3lYqaOAQWWUYiCQ3Hm0XmokytN6IEsTmQNIiY5tvp1 hLKh7ijmDAsNkUTJXPaHhXfUy9uzgiZu1gqgc/NZOTx6fIj71shWbnh3qyCPvcXwhCih1W Qy3lMDXR1u8xfu2RRc+NxsGOVl36uGg= ARC-Authentication-Results: i=1; imf01.hostedemail.com; dkim=pass header.d=intel.com header.s=Intel header.b=B9vlPoNf; dmarc=pass (policy=none) header.from=intel.com; spf=pass (imf01.hostedemail.com: domain of ying.huang@intel.com designates 134.134.136.65 as permitted sender) smtp.mailfrom=ying.huang@intel.com ARC-Seal: i=1; s=arc-20220608; d=hostedemail.com; t=1689035956; a=rsa-sha256; cv=none; b=R/Frz9IZFnlEhxyn1zRq5gf2MMF46mLlI4JbGjOGtCV59aQOmtmdEa+p032GPCsFalx5r2 sxYi43q+JnHg8lQwMHzAdqzlLtbsn28urcnFUSdcolQs8zNjx8+9UAQtSdUXL8lyHKBuH+ MFpF1bie+1CSKwPgzzjaTwc+XNCgCg8= DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1689035955; x=1720571955; h=from:to:cc:subject:references:date:in-reply-to: message-id:mime-version; bh=iNxYXPFDNTfrRIvXLSOf/pg6gRvftGW6JFV+5PL445k=; b=B9vlPoNfmuUEjkzx74M3nUt5WupXKPaGu8mpcqQenBMXZq43/u89pF5C 8rFwHST8I4XxGcLykmVvuupNnZAv9W3oTYkg70CN3qJTeKuHQH3p1Vxeu 9/h2l7odiek/3aDSq8o3r83da3+9Z9ougdbQLslw/mgXw06Bxt0ZKMxXQ uMIcmbk8fTICWuKAU0cmi91fqV4gj3uQoGkqm5guD51EjKY+UlFymMphZ 4tMqMZJAOa3L7jpJNG3ao6kJ52IalKsl3l0Y45ZTTiUWFkiG7YBdB8lcv wfCETvpvkM1GymZtZ/55HDOV1Y8tztfu8THbpRL/IJhlwLFTtoiXrD4RV w==; X-IronPort-AV: E=McAfee;i="6600,9927,10767"; a="367989481" X-IronPort-AV: E=Sophos;i="6.01,195,1684825200"; d="scan'208";a="367989481" Received: from orsmga005.jf.intel.com ([10.7.209.41]) by orsmga103.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 Jul 2023 17:39:00 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10767"; a="894979488" X-IronPort-AV: E=Sophos;i="6.01,195,1684825200"; d="scan'208";a="894979488" Received: from yhuang6-desk2.sh.intel.com (HELO yhuang6-desk2.ccr.corp.intel.com) ([10.238.208.55]) by orsmga005-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 10 Jul 2023 17:38:58 -0700 From: "Huang, Ying" To: Baolin Wang Cc: , , , , , Subject: Re: [PATCH 2/2] mm: compaction: skip the memory hole rapidly when isolating free pages References: <87pm50fg88.fsf@yhuang6-desk2.ccr.corp.intel.com> <36747b09-8141-019f-85dd-59f3d4623ec1@linux.alibaba.com> Date: Tue, 11 Jul 2023 08:37:10 +0800 In-Reply-To: <36747b09-8141-019f-85dd-59f3d4623ec1@linux.alibaba.com> (Baolin Wang's message of "Mon, 10 Jul 2023 17:26:51 +0800") Message-ID: <87zg43e11l.fsf@yhuang6-desk2.ccr.corp.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/28.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=ascii X-Rspamd-Server: rspam09 X-Rspamd-Queue-Id: 7F5CD40009 X-Stat-Signature: z15feguxey7ap4bx4cuzqrtphwyfecbq X-Rspam-User: X-HE-Tag: 1689035955-925100 X-HE-Meta: U2FsdGVkX19xNTewDZ92KxttFW7nHUwmy59DGWNHDC91EPAQ5S9yfOv1XU8BrLVYVmpnajbcVSaxOhuK4Ac3ert0gIZ/3iQsv4Hh1nenkOIVVBZ8TWPPfCAtLzgE11/O5JPTc90Q5M5TcxopM1kLmKv2YVVPQ8WTGpxzzJdR7eWT13EKKYq0/eXEB+D035KJ3h9eOS6izBk7Bpe+kUMCzHuazFoE1eYqX5oHpLe/d1mv8HH5wm+JTlQk80E2GRwUhlnxFaNoZPLXRCHReaVWk4C5ksDoe7JRAfSEDV0GNjx4YNt1HWEApicACLP4BqBzPHjsWJM2pCwk1Jv0hT3phi5BHYifK6pPbukFfQfVL4Sh+jDRhIc16HwXUpyWIpScBUWWTk9LXAEnsINhhvVIgrrkbGjFw7PZVV8yIQWR8glkVeVCip/eFLNIU28qvQbx3hjEbA9gBYACRU7mWGlkJO7EwADN17iAFh4U+fef+dWPy/0a6G+71f+cA4DNUxNjc5kX+HIbHVU/6pP1G8sgwSF+MD205rpG0INVl90NKTmuIKH5GDmFISEwuBpZziLAdXmj6f5sRunLhstFL+fbHAjOL8QNVY0wrbp+5HZnoW9JMaFtl3s3wY5Kpr4vKbfttll2Inud8SzGORaEUnSPvehD6CIvSFczjIPK5qazfX83xdej/JwUFdJ4teMLQyqpFRWcPJHXHddTuUre3A3XTfdzTTYdWQt/No03abxsK7HRh5xzKdpLfkT75Ks9xGKckISM5rUJgRrAJCz2Bx7wwsB8jVCEAFNCn7CmOZhcc/PEVMhG82Bwcl3nBkZ37gwITDVV4wcxtKhAKZYzQWcz72PAZ3WwZcbIlaU40k6OPbuAndfF1H4OT70qH5CeuHUU8OtgxNlibpXXvCShq7kdVs4bl5sgS2GK6d/Oe1rQJbYQdNxIiA76jKy0iqNsNRVjEKwVsYTi8K/T5FUp8vj tG4jupQ9 Jaf/SbInmpSXVMA29X8joNWcVB1ag0tlXlZmvgnatjOVVaAQBQUbzYi/gC82JyItLRoOrL6ZNCbCjWZ8DRAYGIskXZuvgiSznClCE2FfUK/VhmGi7ovxcYKcPdN4zKQvOrKszZaV2duQydRknVWxuz7F2ub+SMMBtepIFcudLKhkKfbJJeTtWjvbjRxpruSJAbpH5 X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: Baolin Wang writes: > On 7/10/2023 2:11 PM, Huang, Ying wrote: >> Baolin Wang writes: >> >>> On my machine with below memory layout, and I can see it will take more >>> time to skip the larger memory hole (range: 0x100000000 - 0x1800000000) >>> when isolating free pages. So adding a new helper to skip the memory >>> hole rapidly, which can reduce the time consumed from about 70us to less >>> than 1us. >>> >>> [ 0.000000] Zone ranges: >>> [ 0.000000] DMA [mem 0x0000000040000000-0x00000000ffffffff] >>> [ 0.000000] DMA32 empty >>> [ 0.000000] Normal [mem 0x0000000100000000-0x0000001fa7ffffff] >> The memory hole is at the beginning of zone NORMAL? If so, should >> zone > > No, the memory hole range is 0x1000000000 - 0x1800000000, and the > normal zone is start from 0x100000000. > > I'm sorry I made a typo in the commit message, which confuses you. The > memory hole range should be: 0x1000000000 - 0x1800000000. I updated > the commit message to the following and addressed David's comment: Got it! Thanks for explanation! > " > Just like commit 9721fd82351d ("mm: compaction: skip memory hole rapidly > when isolating migratable pages"), I can see it will also take more > time to skip the larger memory hole (range: 0x1000000000 - 0x1800000000) > when isolating free pages on my machine with below memory layout. So > like commit 9721fd82351d, adding a new helper to skip the memory hole > rapidly, which can reduce the time consumed from about 70us to less > than 1us. LGTM. Reviewed-by: "Huang, Ying" > [ 0.000000] Zone ranges: > [ 0.000000] DMA [mem 0x0000000040000000-0x00000000ffffffff] > [ 0.000000] DMA32 empty > [ 0.000000] Normal [mem 0x0000000100000000-0x0000001fa7ffffff] > [ 0.000000] Movable zone start for each node > [ 0.000000] Early memory node ranges > [ 0.000000] node 0: [mem 0x0000000040000000-0x0000000fffffffff] > [ 0.000000] node 0: [mem 0x0000001800000000-0x0000001fa3c7ffff] > [ 0.000000] node 0: [mem 0x0000001fa3c80000-0x0000001fa3ffffff] > [ 0.000000] node 0: [mem 0x0000001fa4000000-0x0000001fa402ffff] > [ 0.000000] node 0: [mem 0x0000001fa4030000-0x0000001fa40effff] > [ 0.000000] node 0: [mem 0x0000001fa40f0000-0x0000001fa73cffff] > [ 0.000000] node 0: [mem 0x0000001fa73d0000-0x0000001fa745ffff] > [ 0.000000] node 0: [mem 0x0000001fa7460000-0x0000001fa746ffff] > [ 0.000000] node 0: [mem 0x0000001fa7470000-0x0000001fa758ffff] > [ 0.000000] node 0: [mem 0x0000001fa7590000-0x0000001fa7ffffff] > " > >> NORMAL start at 0x1800000000? And, the free pages will not be scanned >> there? Or my understanding were wrong. > >>> [ 0.000000] Movable zone start for each node >>> [ 0.000000] Early memory node ranges >>> [ 0.000000] node 0: [mem 0x0000000040000000-0x0000000fffffffff] >>> [ 0.000000] node 0: [mem 0x0000001800000000-0x0000001fa3c7ffff] >>> [ 0.000000] node 0: [mem 0x0000001fa3c80000-0x0000001fa3ffffff] >>> [ 0.000000] node 0: [mem 0x0000001fa4000000-0x0000001fa402ffff] >>> [ 0.000000] node 0: [mem 0x0000001fa4030000-0x0000001fa40effff] >>> [ 0.000000] node 0: [mem 0x0000001fa40f0000-0x0000001fa73cffff] >>> [ 0.000000] node 0: [mem 0x0000001fa73d0000-0x0000001fa745ffff] >>> [ 0.000000] node 0: [mem 0x0000001fa7460000-0x0000001fa746ffff] >>> [ 0.000000] node 0: [mem 0x0000001fa7470000-0x0000001fa758ffff] >>> [ 0.000000] node 0: [mem 0x0000001fa7590000-0x0000001fa7ffffff] >>> >>> Signed-off-by: Baolin Wang >>> --- >>> mm/compaction.c | 30 +++++++++++++++++++++++++++++- >>> 1 file changed, 29 insertions(+), 1 deletion(-) >>> >>> diff --git a/mm/compaction.c b/mm/compaction.c >>> index 43358efdbdc2..9641e2131901 100644 >>> --- a/mm/compaction.c >>> +++ b/mm/compaction.c >>> @@ -249,11 +249,31 @@ static unsigned long skip_offline_sections(unsigned long start_pfn) >>> return 0; >>> } >>> + >>> +static unsigned long skip_offline_sections_reverse(unsigned long start_pfn) >>> +{ >>> + unsigned long start_nr = pfn_to_section_nr(start_pfn); >>> + >>> + if (!start_nr || online_section_nr(start_nr)) >>> + return 0; >>> + >>> + while (start_nr-- > 0) { >>> + if (online_section_nr(start_nr)) >>> + return section_nr_to_pfn(start_nr) + PAGES_PER_SECTION - 1; >>> + } >>> + >>> + return 0; >>> +} >>> #else >>> static unsigned long skip_offline_sections(unsigned long start_pfn) >>> { >>> return 0; >>> } >>> + >>> +static unsigned long skip_offline_sections_reverse(unsigned long start_pfn) >>> +{ >>> + return 0; >>> +} >>> #endif >>> /* >>> @@ -1668,8 +1688,16 @@ static void isolate_freepages(struct compact_control *cc) >>> page = pageblock_pfn_to_page(block_start_pfn, >>> block_end_pfn, >>> zone); >>> - if (!page) >>> + if (!page) { >>> + unsigned long next_pfn; >>> + >>> + next_pfn = skip_offline_sections_reverse(block_start_pfn); >>> + if (next_pfn) >>> + block_start_pfn = max(pageblock_start_pfn(next_pfn), >>> + low_pfn); >>> + >>> continue; >>> + } >>> /* Check the block is suitable for migration */ >>> if (!suitable_migration_target(cc, page))