From: David Hildenbrand <david@redhat.com>
To: Florian Fainelli <f.fainelli@gmail.com>, Michal Hocko <mhocko@suse.com>
Cc: Vlastimil Babka <vbabka@suse.cz>, Mel Gorman <mgorman@suse.de>,
Minchan Kim <minchan@kernel.org>,
Johannes Weiner <hannes@cmpxchg.org>,
l.stach@pengutronix.de, LKML <linux-kernel@vger.kernel.org>,
Jaewon Kim <jaewon31.kim@samsung.com>,
Michal Nazarewicz <mina86@mina86.com>,
Joonsoo Kim <iamjoonsoo.kim@lge.com>,
Oscar Salvador <OSalvador@suse.com>,
"linux-mm@kvack.org" <linux-mm@kvack.org>
Subject: Re: alloc_contig_range() with MIGRATE_MOVABLE performance regression since 4.9
Date: Mon, 17 May 2021 09:46:12 +0200 [thread overview]
Message-ID: <d2bf87c0-7a2d-d663-a0ac-99840c77cd44@redhat.com> (raw)
In-Reply-To: <a15bd5c9-870d-3824-99cc-e5073d4d42a1@gmail.com>
On 16.05.21 18:13, Florian Fainelli wrote:
>
>
> On 4/22/2021 12:31 PM, Florian Fainelli wrote:
>>> For
>>>
>>> https://lkml.kernel.org/r/20210121175502.274391-3-minchan@kernel.org
>>>
>>> to do its work you'll have to pass __GFP_NORETRY to
>>> alloc_contig_range(). This requires CMA adaptions, from where we call
>>> alloc_contig_range().
>>
>> Yes, I did modify the alloc_contig_range() caller to pass GFP_KERNEL |
>> __GFP_NORETRY. I did run for a more iterations (1000) and the results
>> are not very conclusive as with __GFP_NORETRY the allocation time per
>> allocation was not significantly better, in fact it was slightly worse
>> by 100us than without.
>>
>> My x86 VM with 1GB of DRAM including 512MB being in ZONE_MOVABLE does
>> shows identical numbers for both 4.9 and 5.4 so this must be something
>> specific to ARM64 and/or the code we added to create a ZONE_MOVABLE on
>> that architecture since movablecore does not appear to have any effect
>> unlike x86.
>
> We tracked down the slowdowns to be caused by two major contributors:
>
> - for a reason that we do not fully understand yet the same cpufreq
> governor (conservative) did not cause alloc_contig_range() to be slowed
> down on 4.9 as much as it it with 5.4, running tests with the
> performance cpufreq governor works a tad better and the results are more
> consistent from run to run with a smaller variation.
Interesting! So your CPU is down-clocking while performing (heavy)
kernel work? Is that expected or are we mis-accounting kernel cpu time
somehow when it comes to determining the CPU target frequency?
>
> - another large contributor to the slowdown was having enabled
> CONFIG_IRQSOFF_TRACER. After c3bc8fd637a9623f5c507bd18f9677effbddf584
> ("tracing: Centralize preemptirq tracepoints and unify their usage") we
> now prepare arguments for tracing even if we end-up not using them since
> tracing is not enabled at runtime. Getting the caller function's return
> address is cheap on arm64 for level == 0, but getting the preceding
> caller involves doing a backtrace walk which is expensive (see
> arch/arm64/kernel/return_address.c).
Again, very interesting finding.
>
> So with these two variables eliminated we are only about x2 slower on
> 5.4 than we were on 4.9 and this is acceptable for our use case. I would
> not say the case is closed but at least we understand it better. We now
> have 5.10 brought up to speed so any new investigation will be focused
> on that kernel.
>
Thanks for the insight, please do let me know when you learn more. x2
slowdown still is quite a lot.
--
Thanks,
David / dhildenb
prev parent reply other threads:[~2021-05-17 7:46 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <dbdf7b8c-9165-f87c-92d4-cfb5a4f01221@gmail.com>
[not found] ` <YIEqpIOAyrs26soC@dhcp22.suse.cz>
2021-04-22 8:56 ` David Hildenbrand
2021-04-22 17:50 ` Florian Fainelli
2021-04-22 18:35 ` David Hildenbrand
2021-04-22 19:31 ` Florian Fainelli
2021-05-16 16:13 ` Florian Fainelli
2021-05-17 7:46 ` David Hildenbrand [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=d2bf87c0-7a2d-d663-a0ac-99840c77cd44@redhat.com \
--to=david@redhat.com \
--cc=OSalvador@suse.com \
--cc=f.fainelli@gmail.com \
--cc=hannes@cmpxchg.org \
--cc=iamjoonsoo.kim@lge.com \
--cc=jaewon31.kim@samsung.com \
--cc=l.stach@pengutronix.de \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mgorman@suse.de \
--cc=mhocko@suse.com \
--cc=mina86@mina86.com \
--cc=minchan@kernel.org \
--cc=vbabka@suse.cz \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox