From: Baolin Wang <baolin.wang@linux.alibaba.com>
To: David Hildenbrand <david@redhat.com>, akpm@linux-foundation.org
Cc: arnd@arndb.de, jingshan@linux.alibaba.com, linux-mm@kvack.org,
linux-arch@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: [RFC PATCH] mm: Introduce new MADV_NOMOVABLE behavior
Date: Thu, 20 Oct 2022 15:15:26 +0800 [thread overview]
Message-ID: <70610ea1-5932-a19f-5eba-c4fba06335da@linux.alibaba.com> (raw)
In-Reply-To: <470dc638-a300-f261-94b4-e27250e42f96@redhat.com>
On 10/19/2022 11:17 PM, David Hildenbrand wrote:
>> I observed one migration failure case (which is not easy to reproduce)
>> is that, the 'thp_migration_fail' count is 1 and the
>> 'thp_split_page_failed' count is also 1.
>>
>> That means when migrating a THP which is in CMA area, but can not
>> allocate a new THP due to memory fragmentation, so it will split the
>> THP. However THP split is also failed, probably the reason is temporary
>> reference count of this THP. And the temporary reference count can be
>> caused by dropping page caches (I observed the drop caches operation in
>> the system), but we can not drop the shmem page caches due to they are
>> already dirty at that time.
>>
>> So we can try again in migrate_pages() if THP split is failed to
>> mitigate the failure of migration, especially for the failure reason is
>> temporary reference count? Does this sound reasonable for you?
>
> It sound reasonable, and I understand that debugging these issues is
> tricky. But we really have to figure out the root cause to make these
> pages that are indeed movable (but only temporarily not movable for
> reason XYZ) movable.
>
> We'd need some indication to retry migration longer / again.
OK. Let me try this and see if there are other possible failure cases in
the products.
>>
>> However I still worried there are other possible cases to cause
>> migration failure, so no CMA allocation for our case seems more stable
>> IMO.
>
> Yes, I can understand that. But as one example, you're approach doesn't
> handle the case that a page that was allocated on !CMA/!ZONE_MOVABLE
> would get migrated to CMA/ZONE_MOVABLE just before you would try pinning
> the page (to migrate it again off CMA/ZONE_MOVABLE).
Indeed, like you said before, just helpful to minimize page migration
now. Maybe I can take MADV_PINNABLE into considering when allocating new
pages, such as alloc_migration_target().
Anyway let me try to fix the root cause first to see if it can solve our
problem.
> We really have to fix the root cause.
OK. Thanks for your input.
prev parent reply other threads:[~2022-10-20 7:15 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-10-17 7:32 Baolin Wang
2022-10-17 8:41 ` David Hildenbrand
2022-10-17 9:09 ` Baolin Wang
2022-10-17 11:27 ` David Hildenbrand
2022-10-18 2:43 ` Baolin Wang
2022-10-19 15:17 ` David Hildenbrand
2022-10-20 7:15 ` Baolin Wang [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=70610ea1-5932-a19f-5eba-c4fba06335da@linux.alibaba.com \
--to=baolin.wang@linux.alibaba.com \
--cc=akpm@linux-foundation.org \
--cc=arnd@arndb.de \
--cc=david@redhat.com \
--cc=jingshan@linux.alibaba.com \
--cc=linux-arch@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox