From: David Hildenbrand <david@redhat.com>
To: Matthew Wilcox <willy@infradead.org>
Cc: Ryan Roberts <ryan.roberts@arm.com>,
Andrew Morton <akpm@linux-foundation.org>,
"Kirill A. Shutemov" <kirill.shutemov@linux.intel.com>,
Yin Fengwei <fengwei.yin@intel.com>, Yu Zhao <yuzhao@google.com>,
Catalin Marinas <catalin.marinas@arm.com>,
Will Deacon <will@kernel.org>,
Anshuman Khandual <anshuman.khandual@arm.com>,
Yang Shi <shy828301@gmail.com>,
linux-arm-kernel@lists.infradead.org,
linux-kernel@vger.kernel.org, linux-mm@kvack.org
Subject: Re: [PATCH v2 0/5] variable-order, large folios for anonymous memory
Date: Fri, 7 Jul 2023 15:24:33 +0200 [thread overview]
Message-ID: <bfa13f35-bca9-c4e8-25f3-e8021f85f223@redhat.com> (raw)
In-Reply-To: <ZKgPIXSrxqymWrsv@casper.infradead.org>
On 07.07.23 15:12, Matthew Wilcox wrote:
> On Fri, Jul 07, 2023 at 01:40:53PM +0200, David Hildenbrand wrote:
>> On 06.07.23 10:02, Ryan Roberts wrote:
>> But can you comment on the page migration part (IOW did you try it already)?
>>
>> For example, memory hotunplug, CMA, MCE handling, compaction all rely on
>> page migration of something that was allocated using GFP_MOVABLE to actually
>> work.
>>
>> Compaction seems to skip any higher-order folios, but the question is if the
>> udnerlying migration itself works.
>>
>> If it already works: great! If not, this really has to be tackled early,
>> because otherwise we'll be breaking the GFP_MOVABLE semantics.
>
> I have looked at this a bit. _Migration_ should be fine. _Compaction_
> is not.
Thanks! Very nice if at least ordinary migration works.
>
> If you look at a function like folio_migrate_mapping(), it all seems
> appropriately folio-ised. There might be something in there that is
> slightly wrong, but that would just be a bug to fix, not a huge
> architectural problem.
>
> The problem comes in the callers of migrate_pages(). They pass a
> new_folio_t callback. alloc_migration_target() is the usual one passed
> and as far as I can tell is fine. I've seen no problems reported with it.
>
> compaction_alloc() is a disaster, and I don't know how to fix it.
> The compaction code has its own allocator which is populated with order-0
> folios. How it populates that freelist is awful ... see split_map_pages()
Yeah, all that code was written under the assumption that we're moving
order-0 pages (which is what the anon+pagecache pages part).
From what I recall, we're allocating order-0 pages from the high memory
addresses, so we can migrate from low memory addresses, effectively
freeing up low memory addresses and filling high memory addresses.
Adjusting that will be ... interesting. Instead of allocating order-0
pages from high addresses, we might want to allocate "as large as
possible" ("grab what we can") from high addresses and then have our own
kind of buddy for allocating from that pool a compaction destination
page, depending on our source page. Nasty.
What should always work is the split->migrate. But that's definitely not
what we want in many cases.
>
>> Is swapping working as expected? zswap?
>
> Suboptimally. Swap will split folios in order to swap them. Somebody
> needs to fix that, but it should work.
Good!
It would be great to have some kind of a feature matrix that tells us
what works perfectly, sub-optimally, barely, not at all (and what has
not been tested). Maybe (likely!) we'll also find things that are
sub-optimal for ordinary THP (like swapping, not even sure about).
I suspect that KSM should work mostly fine with flexible-thp. When
deduplciating, we'll simply split the compound page and proceed as
expected. But might be worth testing as well.
--
Cheers,
David / dhildenb
next prev parent reply other threads:[~2023-07-07 13:24 UTC|newest]
Thread overview: 84+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-07-03 13:53 Ryan Roberts
2023-07-03 13:53 ` [PATCH v2 1/5] mm: Non-pmd-mappable, large folios for folio_add_new_anon_rmap() Ryan Roberts
2023-07-03 19:05 ` Yu Zhao
2023-07-04 2:13 ` Yin, Fengwei
2023-07-04 11:19 ` Ryan Roberts
2023-07-04 2:14 ` Yin, Fengwei
2023-07-03 13:53 ` [PATCH v2 2/5] mm: Allow deferred splitting of arbitrary large anon folios Ryan Roberts
2023-07-07 8:21 ` Huang, Ying
2023-07-07 9:42 ` Ryan Roberts
2023-07-10 5:37 ` Huang, Ying
2023-07-10 8:29 ` Ryan Roberts
2023-07-10 9:01 ` Huang, Ying
2023-07-10 9:39 ` Ryan Roberts
2023-07-11 1:56 ` Huang, Ying
2023-07-03 13:53 ` [PATCH v2 3/5] mm: Default implementation of arch_wants_pte_order() Ryan Roberts
2023-07-03 19:50 ` Yu Zhao
2023-07-04 13:20 ` Ryan Roberts
2023-07-05 2:07 ` Yu Zhao
2023-07-05 9:11 ` Ryan Roberts
2023-07-05 17:24 ` Yu Zhao
2023-07-05 18:01 ` Ryan Roberts
2023-07-06 19:33 ` Matthew Wilcox
2023-07-07 10:00 ` Ryan Roberts
2023-07-04 2:22 ` Yin, Fengwei
2023-07-04 3:02 ` Yu Zhao
2023-07-04 3:59 ` Yu Zhao
2023-07-04 5:22 ` Yin, Fengwei
2023-07-04 5:42 ` Yu Zhao
2023-07-04 12:36 ` Ryan Roberts
2023-07-04 13:23 ` Ryan Roberts
2023-07-05 1:40 ` Yu Zhao
2023-07-05 1:23 ` Yu Zhao
2023-07-05 2:18 ` Yin Fengwei
2023-07-03 13:53 ` [PATCH v2 4/5] mm: FLEXIBLE_THP for improved performance Ryan Roberts
2023-07-03 15:51 ` kernel test robot
2023-07-03 16:01 ` kernel test robot
2023-07-04 1:35 ` Yu Zhao
2023-07-04 14:08 ` Ryan Roberts
2023-07-04 23:47 ` Yu Zhao
2023-07-04 3:45 ` Yin, Fengwei
2023-07-04 14:20 ` Ryan Roberts
2023-07-04 23:35 ` Yin Fengwei
2023-07-04 23:57 ` Matthew Wilcox
2023-07-05 9:54 ` Ryan Roberts
2023-07-05 12:08 ` Matthew Wilcox
2023-07-07 8:01 ` Huang, Ying
2023-07-07 9:52 ` Ryan Roberts
2023-07-07 11:29 ` David Hildenbrand
2023-07-07 13:57 ` Matthew Wilcox
2023-07-07 14:07 ` David Hildenbrand
2023-07-07 15:13 ` Ryan Roberts
2023-07-07 16:06 ` David Hildenbrand
2023-07-07 16:22 ` Ryan Roberts
2023-07-07 19:06 ` David Hildenbrand
2023-07-10 8:41 ` Ryan Roberts
2023-07-10 3:03 ` Huang, Ying
2023-07-10 8:55 ` Ryan Roberts
2023-07-10 9:18 ` Huang, Ying
2023-07-10 9:25 ` Ryan Roberts
2023-07-11 0:48 ` Huang, Ying
2023-07-10 2:49 ` Huang, Ying
2023-07-03 13:53 ` [PATCH v2 5/5] arm64: mm: Override arch_wants_pte_order() Ryan Roberts
2023-07-03 20:02 ` Yu Zhao
2023-07-04 2:18 ` [PATCH v2 0/5] variable-order, large folios for anonymous memory Yu Zhao
2023-07-04 6:22 ` Yin, Fengwei
2023-07-04 7:11 ` Yu Zhao
2023-07-04 15:36 ` Ryan Roberts
2023-07-04 23:52 ` Yin Fengwei
2023-07-05 0:21 ` Yu Zhao
2023-07-05 10:16 ` Ryan Roberts
2023-07-05 19:00 ` Yu Zhao
2023-07-05 19:38 ` David Hildenbrand
2023-07-06 8:02 ` Ryan Roberts
2023-07-07 11:40 ` David Hildenbrand
2023-07-07 13:12 ` Matthew Wilcox
2023-07-07 13:24 ` David Hildenbrand [this message]
2023-07-10 10:07 ` Ryan Roberts
2023-07-10 16:57 ` Matthew Wilcox
2023-07-10 16:53 ` Zi Yan
2023-07-19 15:49 ` Ryan Roberts
2023-07-19 16:05 ` Zi Yan
2023-07-19 18:37 ` Ryan Roberts
2023-07-11 21:11 ` Luis Chamberlain
2023-07-11 21:59 ` Matthew Wilcox
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=bfa13f35-bca9-c4e8-25f3-e8021f85f223@redhat.com \
--to=david@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=anshuman.khandual@arm.com \
--cc=catalin.marinas@arm.com \
--cc=fengwei.yin@intel.com \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=ryan.roberts@arm.com \
--cc=shy828301@gmail.com \
--cc=will@kernel.org \
--cc=willy@infradead.org \
--cc=yuzhao@google.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox