From: Ryan Roberts <ryan.roberts@arm.com>
To: Yu Zhao <yuzhao@google.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
Matthew Wilcox <willy@infradead.org>,
Yin Fengwei <fengwei.yin@intel.com>,
David Hildenbrand <david@redhat.com>,
Catalin Marinas <catalin.marinas@arm.com>,
Will Deacon <will@kernel.org>,
Anshuman Khandual <anshuman.khandual@arm.com>,
Yang Shi <shy828301@gmail.com>,
"Huang, Ying" <ying.huang@intel.com>, Zi Yan <ziy@nvidia.com>,
Luis Chamberlain <mcgrof@kernel.org>,
Itaru Kitayama <itaru.kitayama@gmail.com>,
linux-mm@kvack.org, linux-kernel@vger.kernel.org,
linux-arm-kernel@lists.infradead.org
Subject: Re: [PATCH v4 2/5] mm: LARGE_ANON_FOLIO for improved performance
Date: Wed, 2 Aug 2023 10:04:56 +0100 [thread overview]
Message-ID: <951a8d96-ecdf-7ca4-ec7a-e1c5eba8bce3@arm.com> (raw)
In-Reply-To: <d628df5d-9ff6-f830-ba72-a2b7df7e5d51@arm.com>
On 02/08/2023 09:02, Ryan Roberts wrote:
...
>>>
>>> I've captured run time and peak memory usage, and taken the mean. The stdev for
>>> the peak memory usage is big-ish, but I'm confident this still captures the
>>> central tendancy well:
>>>
>>> | MAX_ORDER_UNHINTED | real-time | kern-time | user-time | peak memory |
>>> |:-------------------|------------:|------------:|------------:|:------------|
>>> | 4k | 0.0% | 0.0% | 0.0% | 0.0% |
>>> | 16k | -3.6% | -26.5% | -0.5% | -0.1% |
>>> | 32k | -4.8% | -37.4% | -0.6% | -0.1% |
>>> | 64k | -5.7% | -42.0% | -0.6% | -1.1% |
>>> | 128k | -5.6% | -42.1% | -0.7% | 1.4% |
>>> | 256k | -4.9% | -41.9% | -0.4% | 1.9% |
>>>
>>> 64K looks like the clear sweet spot to me.
I'm sorry about this; I've concluded that these tests are flawed. While I'm
correctly setting the MAX_ORDER_UNHINTED value in each case, this is run against
a 4K base page kernel, which means that it's arch_wants_pte_order() return value
is order-4. So for MAX_ORDER_UNHINTED = {64k, 128k, 256k}, the actual order used
is order-4 (=64K):
order = max(arch_wants_pte_order(), PAGE_ALLOC_COSTLY_ORDER);
if (!hugepage_vma_check(vma, vma->vm_flags, false, true, true))
order = min(order, ANON_FOLIO_MAX_ORDER_UNHINTED);
So while I think we can conclude that the performance improves from 4k -> 64k,
and the peak memory is about the same, we can't conclude that 64k is definely
where performance gains peak or that peak memory increases after this.
The error bars on the memory consumption are fairly big.
I'll rework the tests so that I'm actually measuring what I was intending to
measure and repost in due course.
next prev parent reply other threads:[~2023-08-02 9:05 UTC|newest]
Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-07-26 9:51 [PATCH v4 0/5] variable-order, large folios for anonymous memory Ryan Roberts
2023-07-26 9:51 ` [PATCH v4 1/5] mm: Non-pmd-mappable, large folios for folio_add_new_anon_rmap() Ryan Roberts
2023-07-26 9:51 ` [PATCH v4 2/5] mm: LARGE_ANON_FOLIO for improved performance Ryan Roberts
2023-07-26 16:41 ` Yu Zhao
2023-07-27 4:31 ` Yu Zhao
2023-07-28 10:13 ` Ryan Roberts
2023-08-01 6:36 ` Yu Zhao
2023-08-01 23:30 ` Yin Fengwei
2023-08-02 8:02 ` Ryan Roberts
2023-08-02 9:04 ` Ryan Roberts [this message]
2023-08-02 13:51 ` Yin, Fengwei
2023-08-03 8:05 ` Yin Fengwei
2023-08-03 8:21 ` Ryan Roberts
2023-08-03 8:37 ` Yin Fengwei
2023-08-03 9:32 ` Ryan Roberts
2023-08-03 9:58 ` Yin Fengwei
2023-08-03 10:27 ` Ryan Roberts
2023-08-03 10:54 ` Yin Fengwei
2023-08-04 0:28 ` Yu Zhao
2023-08-01 6:18 ` Yu Zhao
2023-08-02 9:33 ` Ryan Roberts
2023-08-02 21:05 ` Yu Zhao
2023-08-03 10:24 ` Ryan Roberts
2023-08-03 12:43 ` Ryan Roberts
2023-08-03 14:21 ` Kirill A. Shutemov
2023-08-04 0:19 ` Yu Zhao
2023-08-04 2:16 ` Zi Yan
2023-08-04 3:35 ` Yu Zhao
2023-08-04 9:06 ` Ryan Roberts
2023-08-04 18:53 ` Yu Zhao
2023-08-07 19:00 ` Ryan Roberts
2023-08-03 23:50 ` Yu Zhao
2023-08-04 8:27 ` Ryan Roberts
2023-08-04 20:23 ` David Hildenbrand
2023-08-04 21:00 ` Yu Zhao
2023-08-04 21:13 ` David Hildenbrand
2023-08-04 21:26 ` Yu Zhao
2023-08-04 21:30 ` David Hildenbrand
2023-08-04 21:58 ` Zi Yan
2023-08-05 2:50 ` Yin, Fengwei
2023-08-07 17:45 ` Ryan Roberts
2023-08-07 18:10 ` Zi Yan
2023-08-08 9:58 ` Ryan Roberts
2023-08-07 5:24 ` Yu Zhao
2023-08-07 19:07 ` Ryan Roberts
2023-08-07 23:21 ` Yu Zhao
2023-08-08 9:37 ` Ryan Roberts
2023-08-08 17:57 ` Yu Zhao
2023-08-08 18:12 ` Yu Zhao
2023-08-09 16:08 ` Ryan Roberts
2023-07-26 9:51 ` [PATCH v4 3/5] arm64: mm: Override arch_wants_pte_order() Ryan Roberts
2023-07-26 9:51 ` [PATCH v4 4/5] selftests/mm/cow: Generalize do_run_with_thp() helper Ryan Roberts
2023-07-26 9:51 ` [PATCH v4 5/5] selftests/mm/cow: Add large anon folio tests Ryan Roberts
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=951a8d96-ecdf-7ca4-ec7a-e1c5eba8bce3@arm.com \
--to=ryan.roberts@arm.com \
--cc=akpm@linux-foundation.org \
--cc=anshuman.khandual@arm.com \
--cc=catalin.marinas@arm.com \
--cc=david@redhat.com \
--cc=fengwei.yin@intel.com \
--cc=itaru.kitayama@gmail.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mcgrof@kernel.org \
--cc=shy828301@gmail.com \
--cc=will@kernel.org \
--cc=willy@infradead.org \
--cc=ying.huang@intel.com \
--cc=yuzhao@google.com \
--cc=ziy@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox