From: "Huang, Ying" <ying.huang@intel.com>
To: Johannes Weiner <hannes@cmpxchg.org>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Vlastimil Babka <vbabka@suse.cz>,
Mel Gorman <mgorman@techsingularity.net>,
Miaohe Lin <linmiaohe@huawei.com>,
Kefeng Wang <wangkefeng.wang@huawei.com>,
Zi Yan <ziy@nvidia.com>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org
Subject: Re: [PATCH 1/6] mm: page_alloc: remove pcppage migratetype caching
Date: Sat, 30 Sep 2023 12:26:01 +0800 [thread overview]
Message-ID: <87pm20p9ra.fsf@yhuang6-desk2.ccr.corp.intel.com> (raw)
In-Reply-To: <20230927145115.GA365513@cmpxchg.org> (Johannes Weiner's message of "Wed, 27 Sep 2023 10:51:15 -0400")
Johannes Weiner <hannes@cmpxchg.org> writes:
> On Wed, Sep 27, 2023 at 01:42:25PM +0800, Huang, Ying wrote:
>> Johannes Weiner <hannes@cmpxchg.org> writes:
>>
>> > The idea behind the cache is to save get_pageblock_migratetype()
>> > lookups during bulk freeing. A microbenchmark suggests this isn't
>> > helping, though. The pcp migratetype can get stale, which means that
>> > bulk freeing has an extra branch to check if the pageblock was
>> > isolated while on the pcp.
>> >
>> > While the variance overlaps, the cache write and the branch seem to
>> > make this a net negative. The following test allocates and frees
>> > batches of 10,000 pages (~3x the pcp high marks to trigger flushing):
>> >
>> > Before:
>> > 8,668.48 msec task-clock # 99.735 CPUs utilized ( +- 2.90% )
>> > 19 context-switches # 4.341 /sec ( +- 3.24% )
>> > 0 cpu-migrations # 0.000 /sec
>> > 17,440 page-faults # 3.984 K/sec ( +- 2.90% )
>> > 41,758,692,473 cycles # 9.541 GHz ( +- 2.90% )
>> > 126,201,294,231 instructions # 5.98 insn per cycle ( +- 2.90% )
>> > 25,348,098,335 branches # 5.791 G/sec ( +- 2.90% )
>> > 33,436,921 branch-misses # 0.26% of all branches ( +- 2.90% )
>> >
>> > 0.0869148 +- 0.0000302 seconds time elapsed ( +- 0.03% )
>> >
>> > After:
>> > 8,444.81 msec task-clock # 99.726 CPUs utilized ( +- 2.90% )
>> > 22 context-switches # 5.160 /sec ( +- 3.23% )
>> > 0 cpu-migrations # 0.000 /sec
>> > 17,443 page-faults # 4.091 K/sec ( +- 2.90% )
>> > 40,616,738,355 cycles # 9.527 GHz ( +- 2.90% )
>> > 126,383,351,792 instructions # 6.16 insn per cycle ( +- 2.90% )
>> > 25,224,985,153 branches # 5.917 G/sec ( +- 2.90% )
>> > 32,236,793 branch-misses # 0.25% of all branches ( +- 2.90% )
>> >
>> > 0.0846799 +- 0.0000412 seconds time elapsed ( +- 0.05% )
>> >
>> > A side effect is that this also ensures that pages whose pageblock
>> > gets stolen while on the pcplist end up on the right freelist and we
>> > don't perform potentially type-incompatible buddy merges (or skip
>> > merges when we shouldn't), whis is likely beneficial to long-term
>> > fragmentation management, although the effects would be harder to
>> > measure. Settle for simpler and faster code as justification here.
>>
>> I suspected the PCP allocating/freeing path may be influenced (that is,
>> allocating/freeing batch is less than PCP high). So I tested
>> one-process will-it-scale/page_fault1 with sysctl
>> percpu_pagelist_high_fraction=8. So pages will be allocated/freed
>> from/to PCP only. The test results are as follows,
>>
>> Before:
>> will-it-scale.1.processes 618364.3 (+- 0.075%)
>> perf-profile.children.get_pfnblock_flags_mask 0.13 (+- 9.350%)
>>
>> After:
>> will-it-scale.1.processes 616512.0 (+- 0.057%)
>> perf-profile.children.get_pfnblock_flags_mask 0.41 (+- 22.44%)
>>
>> The change isn't large: -0.3%. Perf profiling shows the cycles% of
>> get_pfnblock_flags_mask() increases.
>
> Ah, this is going through the free_unref_page_list() path that
> Vlastimil had pointed out as well. I made another change on top that
> eliminates the second lookup. After that, both pcp fast paths have the
> same number of lookups as before: 1. This fixes the regression for me.
>
> Would you mind confirming this as well?
I have done more test for the series and addon patches. The test
results are as follows,
base
perf-profile.children.get_pfnblock_flags_mask 0.15 (+- 32.62%)
will-it-scale.1.processes 618621.7 (+- 0.18%)
mm: page_alloc: remove pcppage migratetype caching
perf-profile.children.get_pfnblock_flags_mask 0.40 (+- 21.55%)
will-it-scale.1.processes 616350.3 (+- 0.27%)
mm: page_alloc: fix up block types when merging compatible blocks
perf-profile.children.get_pfnblock_flags_mask 0.36 (+- 8.36%)
will-it-scale.1.processes 617121.0 (+- 0.17%)
mm: page_alloc: move free pages when converting block during isolation
perf-profile.children.get_pfnblock_flags_mask 0.36 (+- 15.10%)
will-it-scale.1.processes 615578.0 (+- 0.18%)
mm: page_alloc: fix move_freepages_block() range error
perf-profile.children.get_pfnblock_flags_mask 0.36 (+- 12.78%)
will-it-scale.1.processes 615364.7 (+- 0.27%)
mm: page_alloc: fix freelist movement during block conversion
perf-profile.children.get_pfnblock_flags_mask 0.36 (+- 10.52%)
will-it-scale.1.processes 617834.8 (+- 0.52%)
mm: page_alloc: consolidate free page accounting
perf-profile.children.get_pfnblock_flags_mask 0.39 (+- 8.27%)
will-it-scale.1.processes 621000.0 (+- 0.13%)
mm: page_alloc: close migratetype race between freeing and stealing
perf-profile.children.get_pfnblock_flags_mask 0.37 (+- 5.87%)
will-it-scale.1.processes 618378.8 (+- 0.17%)
mm: page_alloc: optimize free_unref_page_list()
perf-profile.children.get_pfnblock_flags_mask 0.20 (+- 14.96%)
will-it-scale.1.processes 618136.3 (+- 0.16%)
It seems that the will-it-scale score is influenced by some other
factors too. But anyway, the series + addon patches restores the score
of will-it-scale. And the cycles% of get_pfnblock_flags_mask() is
almost restored by the final patch (mm: page_alloc: optimize
free_unref_page_list()).
Feel free to add my "Tested-by" for these patches.
--
Best Regards,
Huang, Ying
next prev parent reply other threads:[~2023-09-30 4:28 UTC|newest]
Thread overview: 83+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-09-11 19:41 [PATCH V2 0/6] mm: page_alloc: freelist migratetype hygiene Johannes Weiner
2023-09-11 19:41 ` [PATCH 1/6] mm: page_alloc: remove pcppage migratetype caching Johannes Weiner
2023-09-11 19:59 ` Zi Yan
2023-09-11 21:09 ` Andrew Morton
2023-09-12 13:47 ` Vlastimil Babka
2023-09-12 14:50 ` Johannes Weiner
2023-09-13 9:33 ` Vlastimil Babka
2023-09-13 13:24 ` Johannes Weiner
2023-09-13 13:34 ` Vlastimil Babka
2023-09-12 15:03 ` Johannes Weiner
2023-09-14 7:29 ` Vlastimil Babka
2023-09-14 9:56 ` Mel Gorman
2023-09-27 5:42 ` Huang, Ying
2023-09-27 14:51 ` Johannes Weiner
2023-09-30 4:26 ` Huang, Ying [this message]
2023-10-02 14:58 ` Johannes Weiner
2023-09-11 19:41 ` [PATCH 2/6] mm: page_alloc: fix up block types when merging compatible blocks Johannes Weiner
2023-09-11 20:01 ` Zi Yan
2023-09-13 9:52 ` Vlastimil Babka
2023-09-14 10:00 ` Mel Gorman
2023-09-11 19:41 ` [PATCH 3/6] mm: page_alloc: move free pages when converting block during isolation Johannes Weiner
2023-09-11 20:17 ` Zi Yan
2023-09-11 20:47 ` Johannes Weiner
2023-09-11 20:50 ` Zi Yan
2023-09-13 14:31 ` Vlastimil Babka
2023-09-14 10:03 ` Mel Gorman
2023-09-11 19:41 ` [PATCH 4/6] mm: page_alloc: fix move_freepages_block() range error Johannes Weiner
2023-09-11 20:23 ` Zi Yan
2023-09-13 14:40 ` Vlastimil Babka
2023-09-14 13:37 ` Johannes Weiner
2023-09-14 10:03 ` Mel Gorman
2023-09-11 19:41 ` [PATCH 5/6] mm: page_alloc: fix freelist movement during block conversion Johannes Weiner
2023-09-13 19:52 ` Vlastimil Babka
2023-09-14 14:47 ` Johannes Weiner
2023-09-11 19:41 ` [PATCH 6/6] mm: page_alloc: consolidate free page accounting Johannes Weiner
2023-09-13 20:18 ` Vlastimil Babka
2023-09-14 4:11 ` Johannes Weiner
2023-09-14 23:52 ` [PATCH V2 0/6] mm: page_alloc: freelist migratetype hygiene Mike Kravetz
2023-09-15 14:16 ` Johannes Weiner
2023-09-15 15:05 ` Mike Kravetz
2023-09-16 19:57 ` Mike Kravetz
2023-09-16 20:13 ` Andrew Morton
2023-09-18 7:16 ` Vlastimil Babka
2023-09-18 14:52 ` Johannes Weiner
2023-09-18 17:40 ` Mike Kravetz
2023-09-19 6:49 ` Johannes Weiner
2023-09-19 12:37 ` Zi Yan
2023-09-19 15:22 ` Zi Yan
2023-09-19 18:47 ` Mike Kravetz
2023-09-19 20:57 ` Zi Yan
2023-09-20 0:32 ` Mike Kravetz
2023-09-20 1:38 ` Zi Yan
2023-09-20 6:07 ` Vlastimil Babka
2023-09-20 13:48 ` Johannes Weiner
2023-09-20 16:04 ` Johannes Weiner
2023-09-20 17:23 ` Zi Yan
2023-09-21 2:31 ` Zi Yan
2023-09-21 10:19 ` David Hildenbrand
2023-09-21 14:47 ` Zi Yan
2023-09-25 21:12 ` Zi Yan
2023-09-26 17:39 ` Johannes Weiner
2023-09-28 2:51 ` Zi Yan
2023-10-03 2:26 ` Zi Yan
2023-10-10 21:12 ` Johannes Weiner
2023-10-11 15:25 ` Johannes Weiner
2023-10-11 15:45 ` Johannes Weiner
2023-10-11 15:57 ` Zi Yan
2023-10-13 0:06 ` Zi Yan
2023-10-13 14:51 ` Zi Yan
2023-10-16 13:35 ` Zi Yan
2023-10-16 14:37 ` Johannes Weiner
2023-10-16 15:00 ` Zi Yan
2023-10-16 18:51 ` Johannes Weiner
2023-10-16 19:49 ` Zi Yan
2023-10-16 20:26 ` Johannes Weiner
2023-10-16 20:39 ` Johannes Weiner
2023-10-16 20:48 ` Zi Yan
2023-09-26 18:19 ` David Hildenbrand
2023-09-28 3:22 ` Zi Yan
2023-10-02 11:43 ` David Hildenbrand
2023-10-03 2:35 ` Zi Yan
2023-09-18 7:07 ` Vlastimil Babka
2023-09-18 14:09 ` Johannes Weiner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87pm20p9ra.fsf@yhuang6-desk2.ccr.corp.intel.com \
--to=ying.huang@intel.com \
--cc=akpm@linux-foundation.org \
--cc=hannes@cmpxchg.org \
--cc=linmiaohe@huawei.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@techsingularity.net \
--cc=vbabka@suse.cz \
--cc=wangkefeng.wang@huawei.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox